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METHODS OF SURVEYING FOR CC (BETA) CHEMOKINE 
RECEPTOR VARIANTS AND THEIR ASSOCIATION WITH HIV-l 
TRANSMISSION AND/OR DISEASE PROGRESSION 

5 

BACKGROUND OF THE INVENTION 

rrn<;<;-rftfe rpnrp tn Related Application 

10 This non-provisional patent application claims benefit 

of provisional patent application U.S. Serial number 60/148,530 
filed August 12, 1999, now abandoned. 

Federal Funding T.egend 

15 This invention was produced in part using funds 

obtained through grants AI41951, AI40591 and AI42454 from 
NIAID, and grant DA04347 from NBDA. Consequently, the federal 
government has certain rights in this invention. 

20 Field of the Tnventinn 

The present invention relates generally to genotyping 
CCR alleles in individuals and populations. More specifically, the 
present invention relates to methods of correlating COR alleles 
and/or genotypes, specifically CCR5 promoter alleles, with HIV-1 
25 transmission and/or disease progression. 
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De^rripfion of the Related Art 

"Chemokine'' describes a closely related family of 
'chemoiaciic cytokines" with conserved sequences, known to be 
potent attractors for various leukocyte subsets such as 
5 neutrophils, monocytes, or lymphocytes. Chemokines are a large 
superfamily consisting of four subfamilies that display between 
two and four highly conserved NH2-terminal cysteine amino acid 
residues. The CXC (or a) family has the first two NH2-terminal 
cysteines separated by one nonconserved amino acid residue. In 

10 contrast, the CC (or P) family has these cysteines in juxtaposition, 
and the C (or y) family has one lone NHz-terminal cysteine residue, 
while the CX3C (or 5) family has these cysteines separated by three 
intervening amino acids. The large number of chemokines and 
their receptors, together with the expression of chemokine 

15 receptors on cells other than leukocytes (such as epithelial, 
endothelial and smooth muscle cells) is indicative of the 
importance of these molecules. 

Six receptors for the CC family of chemokines have 
been identified: CC-CCRl, CC-CCR2, CC-CCR3, CC-CCR4, CC-CCR5 

20 and the duffy blood group antigen. These proteins are seven 
transmembrane domain G protein-coupled receptors. In general, 
there is broad overlap in the ligands bound by the CC chemokine 
receptors. The human CCR5 and CCR2 chemokine receptor genes, 
which serve as co-receptors with CD4 for HIV, are tightly linked on 

25 chromosome 3p21-22, separated by 20 kb. 

The complex mechanisms for HIV-1 entry into human 
CD4* cells reflect in part its evolving usage of co-receptors (35). 
For example, macrophage-tropic HIV-1 isolates infect cells by 

2 



BNSDOCID <WO .0n2857A2.l > 



wo 01/12857 — 



PCT/USOO/22255 



binding to the CD4^ receptor and the CCR5 co-receptor (36), while 
T-cell-tropic viruses mainly use CXCR4 as the co-receptor (37). 
HIV-1 represents a close relative of simian immunodeficiency virus 
(SIV) that is naturally found in African primates, including 
5 chimpanzees and sooty mangabeys (15). Recent studies have 
confirmed that HIV-1 originated in the chimpanzee subspecies Pan 
troglodytes troglodytes from Central West Africa (16). Both HIV 
and SIV prefer CCR5 as their co-receptor for penetrating cells 
(17-19), which suggests that CC/?5 is commonly expressed in both 

10 human and non-human primates. Therefore, highly conserved 
transcription factor binding elements may represent a critical 
mechanism for regulating transcription of the CCR5 gene, and may 
serve as appropriate targets for intervention experiments. 
Additionally, other members of the CCR co-receptor gene family 

15 also facilitate viral transmission (38), but they appear to have 
more restricted cellular distribution (19,39-40). 

Recent work in AIDS cohorts has revealed associations 
between several polymorphisms in the CC (beta) chemokine 
receptor loci and either variable degrees of protection against HIV- 

20 1 transmission or variable evolution of the AIDS that follows (1-4). 
In Caucasians, homozygosity for a 32-bp deletion (A32) mutation 
in the CCR5 coding sequence leads to the absence of cell surface 
expression of CCR5, which confers nearly complete resistance to 
ilW 1 infection (1,41-44). Another less frequent mutation at 

25 nucleotide position 303 of the CCR5 coding sequence (m303) 
introduces a premature stop codon that also abolishes surface 
expression of CCR5 in Caucasians. Accordingly, an individual 
carrying both CCi?5-A32 and m303 is resistant to HIV-l infection 
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(45). Carriage of a single copy of CC/?5-A32 in the presence of 
CC/?5 wild-type provides little if any resistance to HIV-1 infection, 
but does tend to delay the progression of disease (1,4,46), 
possibly through interference with translocation of the wild-type 
5 product (47). Neither the CC/f5-A32 nor the CC^5-m303 allele is 
frequent in African and Asian populations (1,43,48). However, 
African and Asian ethnic groups also show great variability in 
acquisition and progression of the disease. Population-specific 
polymorphisms within the CCR5 gene may exist (49), but none of 

10 these have been associated with varying degrees of viral 
susceptibility (48). The use of CCR5 as the co-receptor is not 
altered whether or not additional CCR5 polymorphisms are 
present, regardless of the HIV-1 subtype (50). 

In addition, extensive heterogeneities have been 

15 described in the expression and splicing of CCR5, apparently 
regulated by different CCR5 promoter alleles (3). More 
importantly, 2 CCR5 promoter variants have been associated with 
contrasting rates of HIV-1 disease progression (5,6). Expression of 
cell surface CCR5 also varies widely, even in individuals 

20 homozygous for wild type CCR5 (51,52), indicating differential 
regulation of CCR5 production. Therefore, variability in infectivity 
of primary HIV-1 isolates that prefer CCR5 as the co-receptor is 
observed even in wild type individuals. Detailed study of the 
organization of the CCR5 gene and its promoter has uncovered 

25 extensive heterogeneity in the 5' untranslated region (UTR) of 
CCR5 mRNA (13). Some of these polymorphisms may be linked to 
changes in splicing and may account for continuing uncertainties 
about the intron and exon boundaries (13-14,53). One CCR5 
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promoter variant (59653T) shows strong linkage disequilibrium 
with CCR2b-64l (2) - a CCR variant previously associated with 
retarded disease progression (4), and subsequent work has further 
detailed the relationships of CCR5 promoter markers to both 
5 CCR2b and CCR5-A32 (3,5,6). More recent association of the 
59029G/G genotype with reduced promoter activity (5) also 
suggests that CCR5 promoter polymorphisms independently 
modulate HIV-1 disease progression. 

The mechanisms and credibility of these specific 

10 genetic associations are under debate (3,10), especially because 
some of the CCR variants are tightly linked to each other (2). 
However, these findings highlight the importance of host genetic 
factors in HIV/AIDS and may guide development of new measures 
for the prevention and control of HIV-1 -related diseases (7-9). 

15 Polymorphisms at the CCR2b, CCR5, and CCR5 promoter 

loci have been analyzed in four ethnic groups, with a special 
emphasis on the relationships of CCR5 promoter allelic variants t o 
other well-characterized markers previously associated with 
different outcomes of HIV-1 transmission and disease progression. 

20 In addition, the same CCR5 promoter polymorphisms were 
analyzed in HIV-1 -infected Rwandan women, with a special 
emphasis on the influence of the genetic polymorphisms on HIV-1 
disease progression. 

The prior art is deficient in methods used to predict 

25 the likelihood and/or probability of HIV-l transmission and/or 
disease progression based upon CCR alleles and/or genotypes. The 
present invention fulfills a long-standing need for the development 
of a rapid and informative genotyping strategy that can be readily 
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applied to analyze CCR2, CCR5 and related genetic variants and to 
evalulate the relationship of each genotype to HIV transmission 
and disease progression. 

5 SUMMARY OF THE INVENTION 

Variability in HIV-1 infection has been associated with 
genetic variants in the beta-chemokine receptor 5 (CCR5) locus. 
Genetic variations (a 32-bp deletion and a point mutation) in the 

10 coding sequence of the HIV-1 co-receptor, CCi?5, have been shown 
to confer resistance to HIV-1 infection by depleting CCR5 
expression on the cell surface. While CCR5 coding sequences have 
exhibited relatively limited variation, its promoter sequence 
appears polymorphic in all major populations. 

15 The studies reported herein revealed five major CCR5 

promoter alleles with distributions that differed widely among the 
four distinct ethnic groups examined. Herein, the methods of the 
instant invention are used to present evidence that particular 
genetic variants of the CCR5 promoter appear to determine the 

20 infectability of an individual (i.e., herein, of heterosexual women 
in Kigali, Rwanda, and injecting drug users in New York, USA). 
Additionally, the present methods have been used to examine the 
relationship between the major CCR5 promoter genotypes and HIV- 
1 to AIDS disease progression (i.e., among 201 HIV-1 -infected 

25 Rwandan women). The effects of disease progression-related 
CCR2b and CCR5 polymorphisms on early HIV-1 viral load were 
also determined in a cohort of homosexual HTV-l seroconverters. 
The methods of the present invention allowed the inventors to 
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establish an independent and strong linkage between HIV-1 
transmission and CCR5 promoter alleles using these distinct 
cohorts and further, to validate the methodology of the present 
invention. 

5 One object of the present invention is to provide 

methods of correlating CCR alleles and/or genotypes, specifically 
CCR5 promoter alleles, with HIV-1 transmission and/or disease 
progression. 

In an embodiment of the present invention, there is 

10 provided a method of surveying CCR genotypes in a population, 
comprising the steps of: (a) obtaining biological samples from a 
representative number of individuals in a population, with each 
sam.fV3 being from a different individual, wherein the sample 
contains genomic DNA; (b) combining a portion of each sample 

15 with at least one experimental primer combination and a control 
primer combination, thereby producing primer-annealed DNA, 
wherein the experimental primer combinations are selected from 
the group consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 
9, 5 & 10, 5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 

20 & 16 and 15 & 16, wherein the control primer combination is SEQ 
ID Nos. 17 & 18; (c) amplifying the primer-annealed DNA in a 
reaction, thereby producing amplicons, wherein reaction 
conditions for the amplification are optimized for sequence- 
specific amplification, wherein each experimental primer 

25 combination and the control primer combination are predicted to 
produce one or more amplicons having expected sizes in 
basepairs; (d) separating the amplicons by size, wherein the 
presence of: a 197 bp amplicon with the control primer 
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combination is indicative of a CCR5 wildtype coding sequence; a 
165 bp amplicon with the control primer combination is indicative 
of a CCR5-A32 coding sequence; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 367 bp 
5 amplicon with experimental primer combination SEQ ID Nos. 5 & 
10, a 412 bp amplicon with experimental primer combination SEQ 
ED Nos. 5 & 12 and a 309 bp amplicon with experimental primer 
combination SEQ ID Nos. 14 & 16 is indicative of a P*0101 CCR5 
promoter allele; a 363 bp amplicon with experimental primer 

10 combination SEQ ID Nos. 5 & 7, a 367 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 10, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0102 CCR5 promoter allele; 

15 a 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 9, a 367 bp amplicon with 
experimental primer combination SEQ ID Nos. 5&11, a412 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 

20 13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0103 CCR5 promoter allele; 
a 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 

25 experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0201 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
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13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 15 & 16 is indicative of a P*0202 CCR5 promoter allele; 
(e) determining a CCR genotype for each sample based upon the 
CCR alleles indicated following step (d); and (0 compiling the 
5 genotypes determined in step (e), thereby genotyping the 
representative number of individuals in the population, thereby 
surveying CCR genotypes in the population. 

In another embodiment of the present invention, there 
is provided a method of surveying HIV-1 co-receptor CCR alleles in 

10 an individual, comprising the steps of: (a) obtaining a biological 
sample from an individual, wherein the sample comprises genomic 
DNA; (b) combining a portion of the sample with at least one 
experimental primer combination and a control primer 
combination, thereby producing' primer-annealed DNA, wherein 

15 the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 1 0, 

5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 1 7 

6 18; (c) amplifying the primer-annealed DNA in a reaction, 
20 thereby producing amplicons, wherein reaction conditions for the 

amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
amplicons having expected sizes in basepairs; and (d) separating 
25 the amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 
primer combination is indicative of a CCR5-A32 coding sequence; a 
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363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
5 amplicon with experimental primer combination SEQ ID Nos. 14 & 
16 is indicative of a P*0101 CCR5 promoter allele; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 

10 combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 

15 9, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 

20 experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ED Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 
a 363 bp amplicon with experimental primer combination SEQ ID 

25 Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele. 
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In yet another embodiment of the present invention, 
there is provided a method of predicting the disease progression 
to AIDS in an HIV-l-infected individual, comprising the steps of: 
(a) obtaining a biological sample from an individual, wherein the 
5 sample comprises genomic DNA; (b) combining a portion of the 
sample with at least one experimental primer combination and a 
control primer combination, thereby producing primer-annealed 
DNA, wherein the experimental primer combinations are selected 
from the group consisting of SEQ ED Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 

10 8, 5 & 9, 5 & 10, 5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 
13, 14 & 16 and 15 & 16, wherein the control primer combination 
is SEQ ID Nos. 17 & 18; (c) amplifying the primer-annealed DNA in 
a reaction, thereby producing amplicons, wherein reaction 
conditions for the amplification are optimized for sequence- 

15 specific amplification, wherein each experimental primer 
combination and the control primer combination are predicted to 
produce one or more amplicons having expected sizes in 
basepairs; (d) separating the amplicons by size, wherein the 
presence of: a 363 bp amplicon with experimental primer 

20 combination SEQ ID Nos. 6 & 8, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 
16 is indicative of a CCR5 promoter genotype of P*0201/P*0201; a 
363 bp amplicon with experimental primer combination SEQ ID 

25 Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
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8, a 412 bp amplicon with experimental primer combination SEQ 
ID Nos. 6 & 13, a 309 bp amplicon with experimental primer 
combination SEQ ID Nos. 14 & 16 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
5 of a CCR5 promoter genotype of P*0102/P*0202; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos, 5 & 12, a 363 bp amplicon with 

10 experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
P*0101/P*0201; a 363 bp amplicon with experimental primer 

15 combination SEQ ID Nos. 5 & 7, a 367 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 10, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
12, a 363 bp amplicon with experimental primer combination SEQ 
ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 

20 combination SEQ ID Nos. 6 & 13, a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 and a 309 
bp amplicon with experimental primer combination SEQ ID Nos. 15 
& 16 is indicative of a CCR5 promoter genotype of P*0101/P*0202; 
wherein a CCR5 promoter genotype of P*0201/P*0201 or 

25 P*0102/P*0202 is predictive of an accelerated rate of AIDS 
progression in the indiyidual (relative to an individual who does 
not possess the P*0201/P*0201 or P*0I02/P*0202 gentoype), 
wherein a CCR5 promoter genotype of P*0101/P*0201 or 
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P*0101/P*0202 is predictive of a slower rate of AIDS progression 
in the individual (relative to an individual who does not possess 
the P*0101/P*0201 or P*0101/P*0202 gentoype). 

In still yet another embodiment of the present 
5 invention, there is provided a method of predicting the probability 
of HIV-1 infection in an individual, comprising the steps of: (a) 
obtaining a biological sample from an individual, wherein the 
sample comprises genomic DNA; (b) combining a portion of the 
sample with at least one experimental primer combination and a 
10 control primer combination, thereby producing primer-annealed 
DNA, wherein the experimental primer combinations are selected 
from the group consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 
8, 5 & 9, 5 & 10, 5 «& 11. 5 & 12. 5 & 13, 6 & 7, 6 & 8. 6 & 12, 6 & 
13. 14 & 16 and 15 & 16, wherein the control primer combination 
15 is SEQ ID Nos. 17 & 18; (c) amplifying the primer-annealed DNA in 
a reaction, thereby producing amplicons, wherein reaction 
conditions for the amplification are optimized for sequence- 
specific amplification, wherein each experimental primer 
combination and the control primer combination are predicted to 
20 produce one or more amplicons having expected sizes in 
basepairs; and (d) separating the amplicons by size, wherein the 
presence of: a 363 bp with experimental primer combination SEQ 
ID Nos. 6 & 8, a 412 bp with experimental primer combination SEQ 
ID Nos. 6 & 13 and a 309 bp with experimental primer 
25 combination SEQ ID Nos. 14 & 16 is indicative of a CCR5 promoter 
genotype of P*0201/P*0201; a 363 bp with experimental primer 
combination SEQ ID Nos. 5 & 7, a 367 bp with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp with experimental 

13 
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primer combination SEQ ID Nos. 5 & 12 and a 309 bp with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a CCR5 promoter genotype of P*0101/P*0101; wherein a CCR5 
promoter genotype of P*0201/P*0201 is predictive of a decreased 
5 probability of HIV-1 infection in the individual (relative to an 
individual who does not possess the P*0201/P*0201 genotype), 
wherein a CCR5 promoter genotype of P*0101/P*0101 is predictive 
of an increased probability of HlV-1 infection in the individual 
(relative to an individual who does not possess the P*0101/P*0101 
10 genotype). 

In yet another embodiment of the present invention, 
there is provided a method of correlating CC3i genotypes with HIV- 
1 transmission and/or disease progression, comprising the steps 
of: (a) obtaining biological samples from a representative number 

15 of individuals, wherein each sample is from a different individual, 
wherein the sample comprises genomic DNA; (b) assessing each 
individual's HIV-1 status and/or risk of acquiring HIV-1; (c) 
assigning each individual to a risk group, wherein the assignment 
is based upon the individuars HIV-1 status and/or risk of 

20 acquiring HIV-1; (d) combining a portion of each sample with at 
least one experimental primer combination and a control primer 
combination, thereby producing primer-annealed DNA, wherein 
the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 ifc 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 

25 5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 1 7 
& 18; (e) amplifying the primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for the 
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amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
amplicons having expected sizes in basepairs; and (f) separating 
5 the amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 
primer combination is indicative of a CCR5-A32 coding sequence; a 
363 bp amplicon with experimental primer combination SEQ ID 
10 Nos. 5 <& 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 «& 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental prinx; combination SEQ ID Nos. 14 & 
16 is indicative of a P*0101 ':f:;R5 promoter allele; a 363 bp 
1 5 amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
20 of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
9, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
25 combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
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amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 
a 363 bp amplicon with experimental primer combination SEQ ID 
5 Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele. 

In a further embodiment of the present invention, 
10 there is provided an oligonucleotide selected from the group 
consisting of SEQ ID Nos. 2-16. Other aspects, features, and 
advantages of the present invention will be apparent from the 
following description of the presently preferred embodiments of 
the invention. These embodiments are given for the purpose of 
15 disclosure. 



BRIEF DESCRIPTION OF THE DRAWINGS 



The appended drawings have been included herein so 
20 that the above-recited features, advantages and objects of the 
invention will become clear and can be understood in detail. 
These drawings form a part of the specification. The appended 
drawings illustrate preferred embodiments of the invention and 
should not be considered to limit the scope of the invention. 

25 Figure 1 shows a genetic map of CCR variants on 

chromosome 3 and the genotyping strategy of the present 
invention. Figure lA shows polymorphisms in the coding region 
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Of CCR2b and CCR5, known as 64V/I and CCR5A32, respectively, 
occur at nucleotide positions 46295 (based upon GenBank 
sequence U95626) and from 62036 to 62067, respectively. 
Nucleotides 59052G and 59530C enclose the region associated 
5 with basal and induced CCR5 promoter activities. Figure IB 
shows polymorphisms within and slightly beyond the boundaries 
of the CCR5 promoter. These polymorphisms define 6 allelic 
variants in these cohorts (P*0104 was only observed once in the 6 9 
sequenced samples). Single polymorphic positions (59029 and 

10 59653) and segments (boxed) in this region have been targeted in 
various studies and the corresponding alternative allele 
nomenclature is also shown (boxed). Figure IC shows that based 
upon the various features outlined above, genotyping can be 
performed utilizing sequence-specific primers (PCR-SSP-based 

15 typing) that simultaneously target all major variants in 15 different 
PGR reactions. The polarity (5'^3') and the 3' terminal sequence 
of each sequence-specific and general primer used in PCR-SSP are 
indicated by solid and open arrows, respectively. Three SSP 
reactions (2b, 2c and 3c, as indicated by broken lines) do not 

20 contribute to allele assignment but have the potential to detect 
new alleles involving the targeted polymorphic sites. The control 
product in all reactions carries the 32 bp deleted region in the 
CCR5 coding sequence. 

Figure 2 shows examples of PCR-SSP-based typing of 
25 CCR5 variants. Four samples (No. 1 to No. 4) are shown with 1 2 
selected SSP reactions (la, lb, 2a, 2d, 3a, 3b, 3d, 4a, 4b, 2e, 5a, 5 b 
(from left to right)). Arrows indicate the position of the control 
product that is present in all reactions. DNA size markers (M) 
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(New England Biolabs) are also included. Sample No. 1 has the 
genotype CCR5 wild type WT/WT, CCR2b-64V/I and CCR5 promoter 
P*0101/P*0202; Sample No. 2 has the genotype CCR5 WT/WT, 
CCR2b-64V/I and CCR5 promoter P*0102/P*0202; Sample No. 3 has 
5 the genotype CCR5 WT/WT, CCR2b-64V/I and CCR5 promoter 
P*0103/P*0202; Sample No. 4 has the genotype CCR5 WT/WT, 
CCR2b-64V/I and CCR5 promoter P*0201/P*0202. 

Figure 3 shows the allelic frequencies of CCR2b-64ly 
CC/?5-A32, and CCR5 promoter variants in four ethnic groups from 

10 the Bronx, New York (Caucasians, African- Americans, and 
Hispanics) and Kigali, Rwanda. The CCR2b-64l is invariably linked 
to CCR5 promoter allele P*0202 in all 4 groups, while the 
Caucasiar and Hispanic-specific CCR5-A32 is exclusively linked to 
CCR5 promoter allele P*0201 (see Tables 5 & 6). The frequency of 

15 CCR5'A32 in Caucaisan- Americans studied herein was lower than 
those reported elsewhere. 

Figure 4 shows the distribution of SDFl and CCR 
genotypes in participants from Kigali, Rwanda and the Bronx, New 
York. Genotypes that appeared to mediate HIV-1 transmission in 

20 ethnic Africans (African-Americans and Rwandans) are indicated 
by arrows. Genotypes that appeared to mediate HIV-1 
transmission in ethnic Caucasians (Caucasian- Americans and 
Hispanics) are indicated by asterisks. Genotypes showing 
associations with HIV-1 transmission in all ethnic groups are 

25 indicated by an arrow and an asterisk. 

Figure 5 shows the phylogenetic relationships among 
CC/?5 promoter sequences from humans (1-5; and see Figure 6), 
chimpanzees (6), pig-tailed macaques (7), and sooty mangabeys 
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(8). Figure 5A shows CCR5 promoter sequences from non-human 
primate species demonstrate nucleotide substitutions in 2 putative 
transcription-factor-binding (TFB) elements: SIE-like (sis/platelet- 
derived growth factor-inducible element-like) and TATA-like 
5 (TATA box-like AT-rich sequence). Figure SB shows pair-wise 
genetic distance matrix calculated using Kimura's two-parameter 
method (31). The alleles from non-human primates (GenBank 
accession numbers AF109384, AFl 15963, API 15964) are identical 
to human allele P*0102 at all 5 positions found to be polymorphic 

10 in humans. Figure 5C shows phenogram based on maximum 
likelihood method. Numbers next to branches indicate the 
frequencies supported by bootstrap re-analyses of the sequence 
data using both neighbor-joining method (32) in the PHYUP 
package (29) and parsimony method in the PAUP package (30). 

15 Grouping of human promoter allele P*0102 with P*0103 is not 
supported by neighbor-joining or parsimony analysis. 

Figure 6 shows the major CCR5 promoter alleles from 
humans (1-5) and their reported and predicted relationship to 
HIV-1 disease progression. Figure 6 A shows the 5 major CCR5 

20 promoter alleles (deposited in GenBank with accession numbers 
AF109379, AF109380, AF109381, AF109382, AF109383, 
respectively) found in various human ethnic groups are defined by 
5 polymorphic sites in the region spanning nucleotide positions 
59029 to 59653 (relative to GenBank accession numbc^ 059626). 

25 Figure 6B shows relationship between previously recognized 
CCR2b, CCR5 promoter genotypes and the CCR5 promoter alleles 
specified in (4 A). The + and - indicate rapid and slow HIV-1 
disease progression, respectively, as reported in several studies (4- 
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6). The predicted effect (+, rapid progression) of the 
P*0201 7*0201 genotype can be inferred from the reported studies. 

Figure 7 shows polymorphic sites at the CCR2 and 
CCR5 loci. Nomenclature for the CCR5 upstream (?„) and 
5 downstream (Pd) sequence differs widely among studies. The 
alternative numbering of nucleotides is given in parentheses. 

Figure 8 shows PGR haplotyping of CCR5 upstream 
promoter variants using sequence-specific primers. 

Figure 9 shows nucleotides at polymorphic sites of 
10 stable CCR {CCRl-CCRS) haplotypes. 

Figure 10 shows HIV-1 viral load (VL) in people with 
different genotypes. Figure lOA shows VL according to P^ 58755 
(29) A/G and the linked CC/?5-A32. Figure lOB shows 
according to CC/?2-64V/L VI, V2 and V3 were three different 
15 visits. 

Figure 11 shows HIV-1 viral load according to (XR 

haplotypes. 

Figure 12 shows the CCR genotypes exhibiting the 
most contrasting associations with viral load. 
20 Figure 13 shows the relationship between CCR5 

genotypes and disease progression. 

DETAILED DESCRIPTION OF THE INVENTION 

25 Variability in the natural history of HIV-1 infection has 

been associated with polymorphic genetic variants in the beta- 
chemokine receptor 5 {CCR5) locus. For instance, genetic 
variations in the HIV-1 co-receptor, CCR5, including a 3 2- bp 
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deletion and a point mutation, have been shown to confer 
resistance to HIV-1 infection by depleting CC/?5 expression on the 
cell surface. The studies reported herein reveal five major CCR5 
promoter alleles with distributions that differed widely among the 
5 four distinct ethnic groups examined. Herein, evidence is 
presented that genetic variants of the CCR5 promoter also appear 
to determine the infectability of heterosexual women in Kigali, 
Rwanda, and injecting drug users in New York, USA. Additionally, 
the relationship of the major CCR5 promoter genotypes with HIV-1 

10 disease progression has been examined among 201 HIV- 1 -infected 
Rwandan women. The effects of disease progression-related 
CCR2b and CCR5 polymorphisms on early HIV-1 viral load were 
also determined in a cohort of homosexual HIV-1 seroconverters. 

In particular, promoter allele P*0103 (G59029-T59353- 

15 T59356-A59402-C59653, SEQIDNo. 21) was largely restricted to blacks. 
The promoter allele P*0202 ( A59029-C59353-C59356- A59402-T59653, SEQ ID 
No. 23) was tightly linked to the slightly less frequent CCR2b-6A\, a 
variant of the CCR2b gene which is about 12.7 kb upstream of the 
promoter region. Another closely related promoter allele P*0201 

20 (A59029-C59353-C59356-A59402-C59653. SEQID No. 22) exclusively carried 
the far less common CC/J5-A32, a 32-bp deletion in the CCR5 
coding sequence 2 kb downstream from the promoter. Among 
human, chimpanzee, pig-tailed macaque, and sooty mangabey 
promoter allelic sequences, the apparent ancestral lineage of the 

25 promoter sequence (G59029-T59353-C59356-A59402-C59653 = human 
P*0102, SEQ ID No. 20) was highly conserved across the primate 
species analyzed here, while P*0201 and P*0202 arose more 
recently than the other 3 major alleles. 
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Among the various CCR5 promoter genotypes 
comprising the five major alleles (P*0101, P*0102, P*0103, P*0201, 
and P*0202), the genotypes involving allele P*0201 were associated 
vfcfith protection against HIV-1 infection (OR=0.4, /7<0.001 in Kigali; 
5 OR=0.5, p=0.012 in New York), while the homozygous genotype 
P*0101/*0101 showed the opposite effect (OR=4.7, /?=0,140 in 
Kigali; OR=4.1, /7=0.002 in New York). According to multivariate 
logistic regression models, these observed promoter effects were 
independent of age, ethnicity and at least seven high-risk 

10 behavioral factors known to influence HIV-1 infection in the two 
cities: in Kigli, HIV-1 infection = 6.21 + 0.68 x (risk score) - 0.13 x 
age - 0.79 x (CCR5 promoter genotypes involving P*0201); in New 
York, HIV-1 infection = 1.60 + 0.31 x (needle sharing) - 0.06 x age 
+ 1.41 X (CC/?J promoter genotype P*0101/*0101) -0.83 x (being 

15 Caucasian). Differences in the adjusted genetic effects between 
the two cohorts reflected differences in the distribution of the two 
mutually exclusive promoter alleles involved. 

Two promoter genotypes (designated P*0201/*0201 
and P*0102/*0202) were weakly associated with accelerated HIV-1 

20 disease progression (p=0.072 and 0.058, respectively). In 
contrast, 2 promoter genotypes involving a common allele P*0101 
(P*0101/*0201 and P*0101/*0202) were collectively associated 
with slower disease progression (p=0.001, pc=0.015). Both 
P*0102/*0202 and P*0101/*0:t02 genotypes always carried CCR2b- 

25 641, but CCR2b'64l alone had no detectable effect on disease 
progression. These findings support and refine some of the earlier 
observations regarding CC/?J promoter polymorphisms and futher 
suggest that the previously recognized CCR2b'64l relationship is 
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secondary to the CC/?5 promoter allelic effects. Thus, the present 
invention establishes an independent linkage between HIV-1 
transmission and the genetic variation within the CCR5 promoter 
alleles and further establishes a mechanism by which OCR 
5 chemokine receptor polymorphisms governs the initiation and 
pathogenesis of HIV-1 infection. 

The present invention is directed towards methods of 
correlating CCR alleles and/or genotypes, specifically CCR5 
promoter alleles, with HIV-1 transmission and/or disease 
10 progression in an individual or population. 

The present invention is directed towards a method of 
surveying CCR genotypes in a population, comprising the steps of: 

(a) obtaining biological samples from a representative humber of 
individuals in a population, wherein each sample is from a 

15 different individual, wherein the sample comprises genomic DNA; 

(b) combining a portion of each sample with at least one 
experimental primer combination and a control primer 
combination, thereby producing primer-annealed DNA, wherein 
the experimental primer combinations are selected from the group 

20 consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 

5 & 11, 5 & 12, 5 & 13, 6 ife 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 1 7 

6 18; (c) amplifying the primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for the 

25 amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
amplicons having expected sizes in basepairs; (d) separating the 
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amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 
primer combination is indicative of a CCR5-A32 coding sequence; a 
5 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 «& 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 
10 16 is indicative of a P*0101 CCR5 promoter allele; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ED Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
15 experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
9, a 367 bp amplicon with experimental primer combination SEQ 
20 ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8. a 412 bp 
25 amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 
a 363 bp amplicon with experimental primer combination SEQ ID 
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Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nps. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele; (e) determining a CCR genotype 
5 for each sample based upon the CCR alleles indicated following 
step (d); and (f) compiling the genotypes determined in step (e), 
thereby genotyping the representative number of individuals in the 
population, thereby surveying CCR genotypes in the population. 

The present invention is additionally directed towards a 

10 method of surveying HIV-1 co-receptor CCR alleles in an 
individual, comprising the steps of: (a) obtaining a biological 
sample from an individual, wherein the sample comprises genomic 
DNA; (b) combining a portion of the sample with at least one 
experimental primer combination and a control primer 

15 combination, thereby producing primer-annealed DNA, wherein 
the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 1 0, 
5 & 11, 5 i& 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 1 7 

20 & 18; (c) amplifying the primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for the 
amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 

25 amplicons having expected sizes in basepairs; and (d) separating 
the amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 

25 

BNSDOCID: <WO 01 12857A2_I_> 



wo 01/12857 PCT/USOO/22255 

primer combination is indicative of a CCR5rA32 coding sequence; a 
363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 &, 10, a 412 bp amplicon with 
5 experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 
16 is indicative of a P*0101 CCR5 promoter allele; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
10 ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination iEQ ID Nos. 5 & 7, a 363 bp 
1 5 amplicon with experimental primer combination SEQ ID Nos. 5 & 
9, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
20 of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6&8, a412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 
25 a 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
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experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele. 

The present invention is further directed towards a 
method of predicting the disease progression to ADDS in an HIV-1- 
5 infected individual, comprising the steps of: (a) obtaining a 
biological sample from an individual, wherein the sample 
comprises genomic DNA; (b) combining a portion of the sample 
with at least one experimental primer combination and a control 
primer combination, thereby producing primer-annealed DNA, 
10 wherein the experimental primer combinations are selected from 
the group consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 ife 
9, 5 & 10, 5 & 11. 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 
& 16 and 15 & 16, wherein the control primer combination is SEQ 
ID Nos. 17 & 18; (c) amplifying the primer-annealed DNA in a 
15 reaction, thereby producing amplicons, wherein reaction 
conditions for the amplification are optimized for sequence- 
specific amplification, wherein each experimental primer 
combination and the control primer combination are predicted to 
produce one or more amplicons having expected sizes in 
20 basepairs; (d) separating the amplicons by size, wherein the 
presence of: a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 8, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 
25 16 is indicative of a CCR5 promoter genotype of P*0201/P*0201; a 
363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
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experimental primer combination SEQ ID Nos. 5 & 13, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
8, a 412 bp amplicon with experimental primer combination SEQ 
ID Nos. 6 & 13, a 309 bp amplicon with experimental primer 
5 combination SEQ ID Nos. 14 & 16 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a CCR5 promoter genotype of P*0102/P*0202; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 

10 ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 12, a 363 bp amplicon with 
experimental primer combination SEQ ED Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 

15 SEQ ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
P*0101/P*0201; a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 7, a 367 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 10, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 

20 12, a 363 bp amplicon with experimental primer combination SEQ 
ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13, a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 and a 309 
bp amplicon with experimental primer combination SEQ ID Nos. 1 5 

25 & 16 is indicative of a CCR5 promoter genotype of P*0101/P*0202; 
wherein a CCR5 promoter genotype of P*0201/P*0201 or 
P*0102/P*0202 is predictive of an accelerated rate of AIDS 
progression in the individual (relative to an individual who does 
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not possess the P*0201/P*0201 or P*0102/P*0202 gentoype), 
wherein a CCR5 promoter genotype of P*010I/P*0201 or 
P*0101/P*0202 is predictive of a slower rate of AIDS progression 
in the individual (relative to an individual who does not possess 
5 the P*0101/P*0201 or P*0101/P*0202 gentoype). 

The present invention is also directed towards a 
method of predicting the probability of HIV-1 infection in an 
individual, comprising the steps of: (a) obtaining a biological 
sample from an individual, wherein the sample comprises genomic 

10 DNA; (b) combining a portion of the sample with at least one 
experimental primer combination and a control primer 
combination, thereby producing primer-annealed DNA, wherein 
the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 i& 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 1 0, 

15 5 & 11, 5 & 12, 5 & 13, 6 i& 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 1 7 
& 18; (c) amplifying the primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for the 
amplification are optimized for sequence-specific amplification, 

20 wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
amplicons having expected sizes in basepairs; and (d) separating 
the amplicons by size, wherein the presence of: a 363 bp with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp with 

25 experimental primer combination SEQ ID Nos. 6 & 13 and a 309 bp 
with experimental primer combination SEQ ID Nos. 14 & 16 is 
indicative of a CCR5 promoter genotype of P*0201/P*0201; a 363 
bp with experimental primer combination SEQ ID Nos. 5 & 7, a 3 67 
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bp with experimental primer combination SEQ ED Nos. 5 & 10, a 
412 bp with experimental primer combination SEQ ID Nos. 5 & 12 
and a 309 bp with experimental primer combination SEQ ID Nos. 
14 & 16 is indicative of a CCR5 promoter genotype of 
5 P*0101/P*0101; wherein a CCR5 promoter genotype of 
P*0201/P*0201 is predictive of a decreased probability of HIV-1 
infection in the individual (relative to an individual who does not 
possess the P*0201/P*0201 genotype), wherein a CCR5 promoter 
genotype of P*0101/P*0101 is predictive of an increased 
10 probability of HIV-1 infection in the individual (relative to an 
individual who does not possess the P*0101/P*0101 genotype). 

The present invention is yet further directed towards a 
method of correlating CCR genotypes with HIV-1 transmission 
and/or disease progression, comprising the steps of: (a) obtaining 

15 biological samples from a representative number of individuals, 
wherein each sample is from a different individual, wherein the 
sample comprises genomic DNA; (b) assessing each individual's 
HIV-1 status and/or risk of acquiring HIV-1; (c) assigning each 
individual to a risk group, wherein the assignment is based upon 

20 the individual's HIV-1 status and/or risk of acquiring HIV-1; (d) 
combining a portion of each sample with at least one experimental 
primer combination and a control primer combination, thereby 
producing primer-annealed DNA, wherein the experimental primer 
combinations are selected from the group consisting of b£Q ID 

25 Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 5 & 11, 5 & 12, 5 & 
13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein the 
control primer combination is SEQ ID Nos. 17 & 18; (e) amplifying 
the primer-annealed DNA in a reaction, thereby producing 
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amplicons, wherein reaction conditions for the amplification are 
optimized for sequence-specific amplification, wherein each 
experimental primer combination and the control primer 
combination are predicted to produce one or more amplicons 
5 having expected sizes in basepairs; and (f) separating the 
amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 
primer combination is indicative of a CCR5-A32 coding sequence; a 

10 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 

15 16 is indicative of a P*0101 CCR5 promoter allele; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 

20 experimental primer combination SEQ ID Nos. 14 <& 16 is indicative 
of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
9, a 367 bp amplicon with experimental primer combination SEQ 

25 ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 
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experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 
5 a 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 «& 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele; (g) determining a CCR genotype 
1 0 for each sample based upon the CCR alleles indicated following 
step (f); and (h) analyzing the genotypes determined in step (g), 
thereby genotyping the representative number of individuals with 
respect to the risk group assigned each individual, thereby 
correlating CCR genotypes with HIV-1 transmission and/or disease 
15 progression. Representative biological samples include blood, 
serum, saliva, semen, tissue biopsy and isolated DNA. Generally, 
separation is by gel electrophoresis, but may also be by means 
such as size fractionation. A preferred method of amplification 
includes polymerase chain reaction (PCR). Typical means of 
20 analysis include two-tailed Fisher's exact test, multiple logistic 
regression analysis, univariate analysis and multivariate analyses. 

The present invention is additionally directed towards 
the oligonucleotides shown in SEQ ID Nos. 2-16. 

In accordance with the present invention, there may be 
25 employed conventional molecular biology, microbiology, and 
recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., 
Sambrook, Fritsch & Maniatis, "Molecular Cloning: A Laboratory 
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Manual (2nd Ed.)", (1989); "DNA Cloning: A Practical Approach," 
Volumes I and H (D.N. Glover ed. 1985); "Oligonucleotide 
Synthesis" (M.J. Gait ed. 1984); 'Nucleic Acid Hybridization" [B.D. 
Hames & S.J. Higgins eds. (1985)]; "Transcription and Translation" 
5 [B.D. Hames & S.J. Higgins eds. (1984)]; "Animal Cell Culture" [R.I. 
Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, 
(1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" 
(1984). Therefore, if appearing herein, the following terms shall 
have the definitions set out below. 
10 A "DNA molecule" refers to the polymeric form of 

deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in 
its either single stranded form, or a double-stranded helix. This 
term refers only to the primary and secondary structure of the 
molecule, and does not Umit it to any particular tertiary forms. 
15 Thus, this term includes double-stranded DNA found, inter alia, in 
linear DNA molecules {e.g.. restriction fragments), viruses, 
plasmids, and chromosomes. In discussing the structure herein, 
the normal convention of giving only the sequence in the 5' to 3' 
direction along the nontranscribed strand of DNA is used (i.e.. the 
20 strand having a sequence homologous to the mRNA). 

A DNA "coding sequence" is a double-stranded DNA 
sequence which is transcribed and translated into a polypeptide in 
vivo when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined 
25 by a start codon at the 5' (amino) terminus and a translation stop 
codon at the 3* (carboxyl) terminus. A coding sequence can 
include, but is not limited to, prokaryotic sequences, cDNA from 
eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., 
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mammalian) DNA, and even synthetic DNA sequences. A 
polyadenylation signal and transcription termination sequence will 
usually be located 3' to the coding sequence. Transcriptional and 
translational control sequences are DNA regulatory sequences, 
such as promoters, enhancers, polyadenylation signals, 
terminators, and the like, that provide for the expression of a 
coding sequence in a host cell. A "cDNA" is defined as copy-DNA 
or complementary-DNA, and is a product of a reverse 
transcription reaction from an mRNA transcript. 

A "promoter sequence" is a DNA regulatory region 
capable of binding RNA polymerase in a cell and initiating 
transcription of a downstream (3' direction) coding sequence. 
Within the promoter sequence will be found a transcription 
initiation site, as well as protein binding domains (coriserisus 
sequences) responsible for the binding of RNA polymerase. 

As used herein, the terms "restriction endonucleases" 
and "restriction enzymes" refer to enzymes which cut double- 
stranded DNA at or near a specific nucleotide sequence. 

The term "oligonucleotide" is defined as a molecule 
comprised of two or more deoxyribonucleotides, preferably more 
than three. Its exact size will depend upon many factors which, in 
turn, depend upon the ultimate function and use of the 
oligonucleotide. The term "primer" as used herein refers to an 
oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of synthesis when placed under 
conditions in which synthesis of a primer extension product, which 
is complementary to a nucleic acid strand, is induced, Le., in the 
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presence of nucleotides and an inducing agent such as a DNA 
polymerase and at a suitable temperature and pH. The primer may 
be either single-stranded or double-stranded and must be 
sufficiently long to prime the synthesis of the desired extension 
5 product in the presence of the inducing agent. The exact length of 
the primer will depend upon many factors, including temperature, 
source of primer and the method used. For example, for 
diagnostic applications, depending on the complexity of the target 
sequence, the oligonucleotide primer typically contains 15-25 or 

10 more nucleotides to maintain specificity, although it may contain 
fewer nucleotides. 

Primers are selected to be "substantially" 
complementary to different strands of a particular target DNA 
sequence. This means that the primers must be sufficiently 

15 complementary to hybridize with their respective strands. 
Therefore, the primer sequence need not reflect the exact 
sequence of the template. For example, a non-complementary 
nucleotide fragment may be attached to the 5' end of the primer, 
with the remainder of the primer sequence being complementary 

20 to the strand. Alternatively, non-complementary bases or longer 
sequences can be interspersed into the primer, provided that the 
primer sequence has sufficient complementarity with the sequence 
or hybridize therewith and thereby form the template for the 
synthesis of the extension product. 

25 As used herein, the term "allele" is defined as the 

combination of polymorphic nucleotides at a given site on a 
chromosome. 
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As used herein, the term "genotype" is defined as the 
combination of two alleles in a given individual. 

As used herein, the term "sequence-specific 
amplification" refers to ?CR using sequence-specific primers that 
5 only produce amplicon(s) when the target sequence is present. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not 
meant to limit the present invention in any fashion: 

10 EXAMPLE 1 

Subjects from 2 established cohorts were studied; one 
in Kigali, Rwanda, and the other in New York City. 
Sociodemographic characteristics and epidemiological findings 

15 have been described elsewhere for these cohorts (20-23). The 
Kigali cohort (20,21) included 202 seropositive women with 
adequate follow-up since 1986. One of these subjects did not yield 
complete genotyping data and was excluded from further analysis. 
The cohort additionally included 77 seronegative women. 

20 Categorization of subjects were based on their clinical outcomes in 
combination with laboratory findings as defined by the Kigali 
staging system (21). Three subgroups of Kigali's were established: 
15 rapid progressors (RPs) who died from AIDS in <6 years; 100 
slow progressors (SPs) who remained asymptomatic for >10 years, 

25 and subjects that could not be classified definitively based on 
disease progression were treated as indeterminants (n=53) and 
combined with intermediate progressors (n=33) who had less 
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extreme clinical outcome in the period of 6-10 years after 

seroconversion. 

The clinical and epidemiologic categorization of the 
subset of heterosexual HIV- 1 -positive Rwandan women available 
5 for genetic studies have been summarized. For the 7 7 
seronegative and the 201 seropositive women, the following risk 
factors were analyzed: 1) currently living alone but having 
multiple sexual partners or currently living with a single partner 
but both partners having extramarital affairs; 2) history of sexually 
10 transmitted disease in the past five years; 3) having multiple 
lifetime sexual partners; and 4) no condom use during sex in the 
last 2 years. 

The Bronx, New York cohort consisted of subsets of 
injecting drug users recruited from three ethnic groups, including 
15 58 Caucasians, 135 Hispanics, and 35 African-Americans (22,23). 
From the New York cohort of injecting drug users, 141 individuals 
were selected who remained seronegative despite high risk (based 
on a combination of the following non-exclusive risk behaviors) 
and 87 who became infected despite relatively little such risk or 
20 exposure: 1) having sex with another injecting drug user; 2) 
sharing needles in shooting galleries; 3) attending shooting 
galleries; and 4) sharing needles with strangers. Men having sex 
with other men was another risk factor observed in the overall 
New York cohort, but not in the subset of subjects studied here. 
25 Studies in both cohorts conformed to the procedures for informed 
consent approved by local and/or sponsoring institutional review 
boards. 
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FYAMPf.Fl 

Primati' spp.riftS 

Non-human primates, including 2 chimpanzees (Pan 
troglodytes), 2 pig-tailed macaques {Macaca nemestrina), and 2 
5 sooty raangabeys (Cercocebus atys) were studied for their CCR5 
promoter polymorphisms. Their CCR5 promoter sequences were 
amplified by PGR and analyzed by automated sequencing as for 
human samples. 

10 FYAMPf K 3 

fipnofyping materials 

DNA samples used throughout the study were extracted 
from whole buffy coats or precipitates of cervicovaginal fluids, 
using the standard salting out procedures (24,25) and QIAamp 
15 blood kit (QIAGEN Inc., Chatsworth, Calif.. USA), respectively. 

FYAMPIF.4 

fipnotyp ing procedures 

Genotyping of CCR5'A32 by PCR amplification size 

20 polymorphism and CCR2b-64l by PCR-RFLP was performed (1,4). 
Typing of CCR5 promoter variants was initially achieved through 
automated sequencing of PCR-amplified products. The CCR5-&32 
and CCR2b-64l are approximately 2 kb downstream and 12.7 kb 
upstream, respectively, from the CC/JJ downstream promoter that 

25 spans nucleotide 59052G (in the extended loci as defined by 
GenBank sequence U59626) to 59530C (13). 

Typing of CCR5 promoter variants was initially achieved 
through automated sequencing of PCR-amplified products 
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corresponding to nucleotide position 59012G to 59943G. Briefly, 
the design of allele-specific PGR amplification primers took 
advantage of the dimorphic site at nucleotide position 59029 
(relative to Genbank accession number U95626), 20 bp upstream 

5 from the first nucleotide of the minimum CCR5 promoter segment 
(14). Two separate PGR reactions were performed for each 
sample: the first using the 59029G-specific forward primer 
(nucleotides 590120-^590290) and common reverse primer 
(CCR5P-COM3N. 59925G<- 599430), the second using the 59029A- 

10 specific forward primer (5O012G^59029A) and primer CCR5P- 
C0M3N. 

PGR conditions were optimized to allow sequence- 
specific amplification determined by the 59029G/A-specific 
primers: each 12.5 ^l PGR consisted of IX buffer C (60 mM Tris- 

15 HGl (pH 8.5), 15 mM (NH4)2S04, 2.5 mM MgGlj), 0.3 units of 
AmpliTaq polymerase, together with 0.5 ^lM of each primer, 80 ng 
genomic DNA, 0.2 mM each of dOTP, dGTP. dTTP and dATP. The 
PGR mix was subjected to 10 cycles of denaturing at 95°C for 2 5 
sec, annealing at 59°G for 40 sec, and extension at 72°C for 50 sec, 

20 followed by 23 cycles of denaturing at 95°C for 25 sec, annealing 
at 55»G for 40 sec, and extension at 72°C for 50 sec. Samples 
yielding a 932-bp amplicon were diluted 1:40 in TE buffer (10 mM 
Tris-HCl (pH 8.0). 2 mM EDTA) and 1.0 til of the diluted product 
served as template for a 50-^1 second-round PGR with a 

25 biotinylated reverse primer (GGR5P-G0MR, 59709T<-59729T) and 
a forward Ml 3 (18 mer)-tailed primer (M13-CCR5P. 
590120^590280). 
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After a further 22 cycles of denaturing at 95°C for 2 5 
sec, annealing at 59°C for 40 sec. and extension at ll^C for 50 sec. 
the resulting products were bound to streptavidin-coated combs 
and single-stranded templates were generated and sequenced using 
5 a Cy5-labelled universal MI3 primer (5 -ACA GGA AAC AGC TAT 
GAC-3'; SEQ ID No. 1), a Cy5-labelled internal primer (CCR5PSBT. 
59291T-.59310A). and the ALFexpress autoload sequencing 
procedures (Pharmacia Biotech, Inc., Piscataway, NJ, USA). The 
allelic sequencing strategy outlined above resolved all known 
10 polymorphic sites in the constitutive downstream CC/?5 promoter 
region without cloning and its attendant problems due to PCR- 
introduced nucleotide substitution (26). The reliability of data 
generated by automated sequencing was further confirmed by 
sequencing the complementary strand. 

expedite the genotyping procedures, 15 pairs of 
sequence-specific primers (SSP) were used to type the 5 major 
CCR5 promoter alleles simultaneously with the CCR5 and CCR2b 
variants (Tables 1 & 2). The final conditions for SSP-based typing 
were as follows: 1) Each 10 nl PGR solution contained IX buffer C. 
20 50-70 ng of genomic DNA, 0.3 units of AmpliTaq polymerase 
(Fisher Scientific, Norcross. OA, USA), 120 nM of each control 
primer, 250 nM each of specific primer, 0.4 mM each of dGTP, 
dCTP, dTTP and dATP, 10% (v/v) glycerol, and 0.02% cresol red; 2 ) 
PCR cycle reactions were performed on an UNO-thermoblock 
25 (Biometra. Inc., Tampa, FL, USA) for 10 higher-stringency cycles of 
denaturing at 94«>C for 25 sec, annealing at 60»C for 45 sec, and 
extension at 72<'C for 45 sec, followed by 21 additional lower- 
stringency cycles of denaturing at 94»C for 25 sec, annealing at 
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57°C for 40 sec, and extension at ll^C for 40 sec; 3) all PCR 
reactions were done in 96-welI microliter plates, each capable of 
typing 6 individual samples; 4) before each cycle reaction, PCR 
began with a denaturing step at 95°C for 2.5 min; 5) all PCR 
5 reactions included a final extension step at 72°C for 6 min; 6) half 
of each PCR reaction product was loaded directly onto a 1.7% 
agarose gel for electrophoresis. The SSP-banding patterns were 
recorded on photographs of ethidium bromide-stained gels. 

The use of PCR-SSP was limited to the identification of 

10 known genetic variants; novel alleles that may exist in different 
populations could either give rise to novel reaction patterns or be 
misclassified by the scheme used herein. The 67 samples from 
Kigali and New York typed by both sequencing and PCR-SSP 
demonstrated 100% comi:itency between the 2 methods (Table 3). 

15 Thus, new alleles were not encountered even when the number of 
genotyped samples increased. 



BNSCXXID; <WO 01t2e67A2.l.> 



41 



vVO«l/12857 ~ PCT/lJS(M>/22255 



TARI.F. 1 

Olign pritTiprs iisftd fr>r typing major polvmorphisms — in CCR?.h. 
rrRS anH thp rrP S Hnwnstrp.am prnmofp.r region 



5 Olign name. 



■Spprifirity* 5'->3' sequence Annealing SEQ 

(iinHffrHnp=pn1yfnnrphir'> position TP No . 



10 



15 



20 



25 



CCR2b-5/IS 


CCR2b-64V 


CCR2b-5/2S 


CCR2b-64I 


CCR2b-3/lG 


general 


CCR5P-5/1S 


59029G 


CCR5P-5/2S 


59029A 


CCR5P-3/1S 


59353T;59356C 


CCR5P-3/2S 


59353C 


CCR5P-3/3S 


59353T;59356T 


CCR5P-3/4S 


59356C 


CCR5P-3/5S 


59356T 


CCR5P-3/6S 


59402G 


CCR5P-3/7S 


59402A 


CCR5P-5/3S 


59653C 


CCR5P-5/4S 


59653T 


CCR5P-3/8G 


general 


CCR5.SP4G 


general 


CCR5-PM6G 


general 



tgggcaacatgctggtcg 
tgggcaacatgctggtca 

tggaaaataagggccacagac 

gagtggagaaaaaggggg 

gagtggagaaaaagggga 

agaatagatctctggtctgaaa 

agaatagatctctggtctgaag 

gagaatagatctctggtctaaaa 

tagagaatagatctctggtctg 

tagagaatagatctctggtcta 

agaatcagagaacagttcttcc 

agaatcagagaacagttcttct 

caggaaacccatagaagac 

caggaaacccatagaagat 

gtgggcacatattcagaag 

tcattacacctgcagctctc 

tggtgaagataagcctcac 



46278-»46295 


2 


46278-^46295 


3 


61619->61636 




46670«-46690 


4 


59012-^59029 


5 


59012-»59029 


6 


59353<-59374 


7 


59353<-59374 


8 


59353<-59375 


9 


59356<-59377 


10 


59356<-59377 


11 


59402<-59423 


12 


59402<-59423 


13 


59635-^59653 


14 


59635^59653 


15 


59925<-59943 


16 


62004^62023 


17 


62182<-62200 


18 



♦The numbering of nucleotide positions is based upon GenBank 
sequence U95626. 
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TARI.K 2 

Primer mixes used in PCR-SSP-hased typing q£ major 



polymorphisms in the TCR^h CCRS and the CCRS promoter 





Primer mixes 


Specificity^ 


Size of PCR 


5 






amplicon 




1 a=CCR2-5/l S+CCR2-3/ 1 u 


L,L.KzD-04 V ai 


/II "X. 
*f 1 J 


Dp 




1 b=CCR2-5/2S+CCR2-3/l O 

i_ 


CCKZD-a4ile, UUKD 








bp 

2a=CCR5P-5/lS+CCR5r-3/li> 






hn 

op 


10 


2b=CCR5P-5/l S+CCR5P-3/2S 


3 y u z y o - 2) y 3 D 3 




hn 




2c=CCR5P-5/2S+CCK!)r-3/io 




hn 
op 








tx" 0 O 




hn 
Dp 




2e2=CCK5r-D/lo+CCKDr-3/3o 






hn 

op 






DV3 J 0 1 






15 


3a=CCR5P-5/l S+CCR5r-3/o:> 




41 9 


hn 
op 




3b=CCR5P-5/l S+CCR5P-3/7S 


59029G^59402A412 


bp 






3c=CCR5P-5/2S+CCR5r-3/oJ> 




bp 






3d=CCR5P-5/2S+CCR5P-3/7S 


59029A-59402A412 


bp 






4a=CCR5P-5/3S+CCR5P-3/8G 


59653C 


309 


bp 


20 


4b=CCR5P-5/4S+CCR5P-3/8G 


59653T 


309 


bp 




5a=CCR5P-5/l S+CCR5P-3/4S 


59029G-59356C 


367 


bp 




5b=CCR5P-5/lS+CCR5P-3/5S 


59029G-59356T 


367 


bp 




Control= 










CCR5-SP4G+CCR5-PM6G 


CCR5 






25 




A32 


165 


bp 






WT 


197 


bp 



* The numbering of nucleotide positions is based upon GenBank 
30 sequence U95626. 
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TABf 3 

Extended CCR5 (lownstrfam promorer alleles ohtainpH hy nllplir 

sefluencinp of PCR products derived from 67 ^ft]prrivf>]y ^pgnenrpH 
samples 

5 

Allele 59029 Combinations of 59653 SEQ ID 

designations n sequence polymoqjhic sites** sequence No. 

,n ^^KKm'ni \i ^ lmsrCs„s,-Gs,.02-Gs.e.s (=P4) C 24 

S T5,3„-C„„,-A„,o2-G5^8 (=P2) C 25 

V S j5,353->3«-A3,.02-G„,,« (=P3) C 26 

S^KKSiSf 9 W9«56-A5,^,-A,^, (=P2) C 27 

S£5cE!2J2i ^ 9,3„-C3,35,-A3,,,,-G5^3 (=P1) C 28 

CCR5P*0202 56 A C„3„.C„3,,-A„^2-G5^ (=P1) T 29 



15 



'AUeUc sequences P*0101, ?*0102, P*0103, P*0201 and P*0202 
have been deposited . in GenBank with Accession numbers 
AF109379, AF109380, AF109381, AF109382, AF109383, 
respectively. Allele P*0104 was only observed once. "The 
20 nnmbsring of nucleotide positions is based on GenBank sequence 
L 9.5626. Polymorphism at position 59029 corresponds to the 
cl.vSiiification of an earlier study- (5). This region has also been 
studied (6) and the alternative allele nomenclature is shown in 
parentheses. 

25 



RYAMPfF.^ 

T.inkaPft analysis of penetir variants at the- rrR7h^ Cr.R'i anrf Crifi 
pmmotp.r Inri 

30 Linkage between allelic variants at different loci are 

typically estimated with data from families (27), which were not 
available in the cohorts studied herein. Instead, "apparent" 2- 
locus linkage was determined by tests for a 2 x 2 table 
containing the numbers of individuals with both, one without the 

35 other, or neither of the 2 markers in question (28). A p value 
documenting a significant association of two alleles {i.e., 
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suggesting linkage disequilibrium) did not eliminate the possibility 
of their coincident occurrence on opposite chromosomes. 
Conversely, fully reliable two-locus haplotypes could be 
established only when alleles at one of the two loci were 
5 homozygous, and only such haplotypes were accepted for analysis. 



FYAMPT.K6 

PhyingPnPtir analyses of CCR5 nromotfT sfrfluences 

The phylogenetic relationships among allelic sequences 

10 from human and non-human primates species were analyzed using 
software in the PHYUP (29) and PAUP (30) packages. All CCR5 
promoter sequences were aligned manually, with gaps introduced 
to increase alignment of sequences from primates. The final data 
set contained sequences spanning from nucleotide position 59029 

15 (in the extended loci as defined by GenBank sequence U59626) to 
59940. 



FYAMPTK? 

fiff^t yping nf rc ^Oh.f^dJ hy PTR-RRP and PTR with senuence- 

20 cpprifir prinfiftrs fSSP^ 

The CCR2b'64 polymorphisms were determined by PCR- 
RFLP based on an amplification-created restriction site in the 129- 
bp fragment (4) corresponding to nucleotide positions 46275 to 
46403 (relative to U95626). The CC/?;^-64Y-bearing sequence 
25 could be digested by BsaBl into two fragments of 18 bp and 111 
bp, while the sequence carrying the CCR2b-64W was refractory to 
digestion. To differentiate the CCR2b-64V from CCR2b'64h PCR- 
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SSPwas used in two separate PGR reactions with both control and 
sequence-specific primer sets. 



FYAMPf.F.8 

5 n^nntyping nf TrRS-A ^ ? Ky PPR amplifiratinn si7ff nnlvmomhism 

PGR amplification of the CCR5 coding sequences 
corresponding to nucleotide positions 62004-62183 (relative to 
U95626) differentiated the CCR5-A32 and wild type alleles (1), 
producing a 180-bp amplicon for the wild type sequence and a 
10 148-bp product for the deletion sequence. Additional primers 
were also used to amplify the deletion region for sequencing and 
size differentiation in a PCR-SSP format. 



f:xampt.E9 

15 qimiihanpniK; gpnntypi n a of CCROh CCHS. and CCR*) nromotcr 
polymnrphisms hasftd on PCR-SSP 

To facilitate simultaneous PCR-SSP-based typing of the 
dimorphisms CCR2b-64V/l, CCR5 wild type/CC/?5-A32, and CCR5 
promoter alleles P*0101, P*0102, P*0103, P*0201 and P*0202, a 

20 panel of 21 individual primers were selected based upon a) similar 
melting temperatures; b) no mispriming; c) no strong secondary 
structure; d) no homodimer/heterodimer formation. The first 1 2 
primer mixes were adequate for the routine genotyping of each 
sample. Three additional primer mixes were required to confirm 

25 samples carrying CCR5P*0103, which gave weak positive signals in 
reaction 2a due to the presence of 59356T. Samples whose CCR5 
promoter variants were genotyped by sequencing and whose 
CCR2b dimorphism was genotyped by PCR-RFLP (4) served as 
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reference materials for testing the reliability of the PCR-SSP assay. 
Different thermostable DNA polymerases including Amplira^ 
(Promega, Madison, WI, USA) and Platinum Tag (Life Technologies, 
Gaithersburg, MD, USA) yielded comparable data sets, showing 
5 100% consistency between PCR-SSP and sequencing and between 
PCR-SSP and RFLP. 

Using the PCR-SSP format, the CC/?5 genotype could be 
assigned according to the size of the SSP control product amplified 
by primers flanking the 32-bp deleted region. Confirmatory 

10 product was also available in SSP reaction lb: primer CCR2b-5/2S 
(also CC/?2£>-64I-specific) and primer CCR5-PM6G (CC/f5-general) 
in this multiplex reaction amplify a larger fragment spanning the 
32-bp deletion region in CCR5. As a result, the undeleted wild type 
CCR5 sequence gave rise to a 197 bp product in all 12 reactions as 

15 well as a 585 bp product in reaction lb, while the CCR5-A32 
genotype produced 165 bp and 553 bp amplicons in the 
corresponding reactions. The CC/J5-specific product from reaction 
lb was also tested by RFLP for the presence of the CC/?5-m303 
polymorphism (45), which were uncommon or absent in other 

20 populations (1,43,48) as well as in the New York and Rwandan 
subjects tested here. 

EXAMPT.F in 

Statistical analysis 
25 Kigali and New York subjects were analyzed separately; 

New York subjects belonged to three, ethnic groups and, as such, 
were analyzed separately and jointly. Standard procedures in 
Epilnfo software (33) were used to calculate and odds ratio 
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calculations with 95% confidence intervals using differential 
distribution of genetic factors in subjects with varying rates of 
HIV-1 disease progression, to measure apparent linkage 
disequilibrium among CCR variants, and to evaluate associations of 
5 genetic and non-genetic risk factors with HIV-1 infection. Multiple 
logistic regression analysis by SAS (SAS Institute, Inc., Gary, NC) 
was performed by incorporating 1) all CCR and SDFl variants; 2) 
ethnicity and age of participants; and 3) individual risk factors 
with significant associations with HIV-1 infection (Kigali) or 
10 individual risk factors used for selecting participants (New York). 
Multivariate analyses were also performed using a summary risk 
score, but this reduced model was less informative. Yates' 
corrected method or a two-tailed Fisher's exact test was applied to 
analyses with small numbers of observations. 

15 

FYAMPI F 11 

Idftntifirafion and frpnenrv of CCPS prnninf/.r anH n rhfr 

relatP.d markpr^ 

Analyses yielded 6 different alleles and 15 genotypes in 

20 the cohort of Rwandan women, 9 of which were seen at a 
frequency higher than 5% (Table 4). Direct allele-level sequencing 
of the CC/J5 promoter region spanning nucleotide 59029 to 59653 
of PCR-amplified human genomic DNA revealed 6 CCR5 promoter 
alleles: five were common and one was observed in only a single 

25 individual (Table 3). The 5 common alleles (not including the rare 
polymorphism at position 59648) could theoretically produce 15 
genotypes (allele combinations), 14 of which were observed in the 
60+ samples examined herein. The 5 major allelic variants were 
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defined by dimorphic sites at nucleotide positions 59029, 59353, 
59356, 59402, and 59653 relative to GenBank sequence U59626. 
By this definition, allele sequences P*0101 and P*0201 matched 
GenBank sequences U95626 and AF031237, respectively. 
5 The distributions of CCR2b, CCR5, and CCR5 promoter 

variants differed widely among distinct ethnic groups (Figure 3). 
Overall, 6 major ('major' is defined herein as having a frequency 
higher than 5%) CCR5 promoter genotypes were detected in 
Hispanics, 7 in Caucasian- Americans, 9 in African- Americans, and 

10 10 in Rwandans (Figure 4). Several genotypes were apparently 
restricted to certain ethnic groups, but the small number of 
subjects in the African-American and Caucasian-American groups 
did not allow reliable comparisons among populations surveyed 
here. Hispanic and non-Hispanic Caucasians -efcmble each other 

15 rather closely, and black African- Americans in New York and 
Rwandans also resemble each other in the distributions of 
promoter genotypic frequencies (Figure 4). In particular, allele 
P*0103 was largely restricted to populations of African origin. As 
a result, there were more CCR5 promoter genotypes (allele 

20 combinations) in Africans than Caucasians and Hispanics. Multiple 
linkages among CCR2b, CCR5, and CCR5 promoter variants were 
consistent in all populations (Tables 5 & 6). For example, 
exclusive linkage of promoter allele P*0202 (59653T) to CCR2b'6Al 
was consistent with an earlier observation in Caucasians (2). 

25 Meanwhile, promoter allele P*0201 was the only variant that was 
linked to CCR5- A32. Mutually exclusive linkages with P*0201 and 
P*0202 implied that CCR2b-Ml and CC/J5-A32 could not exist on 
the same chromosome, and the negative linkage between CCR2b- 
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641 and CC/?5-A32, as revealed by the analysis herein, confirmed 
this association (or lack, thereof) (Table 5). 

Sequences from non-human primates revealed 
differences in 2 of 6 putative transcription factor binding elements 
5 within the CCR5 promoter (Figure 5A). As would be predicted, 
pair-wise genetic distance (Figure 5B) and phylogenetic analyses 
(Figure 5C) placed the chimpanzee CCR5 promoter sequence closer 
to all human CCR5 promoter alleles than to alleles from other 
primate species. The ancestral lineage of human CCR5 promoter 

10 alleles appeared to be P*0102, which shared identical sequences 
with other non-human alleles at nucleotide positions that define all 
major promoter alleles in humans (i.e., 59029, 59353, 59356, 
59402, and 59653). Moreover, P*0201 and P*0202 apparently 
diverged from the common ancestral sequence more recently than 

15 did other alleles. By the same analyses, polymorphisms at two 
positions, the G at 59029 and the T at 59353, appeared ancestral 
to any other lineages. 

Finally, in all human and non-human alleles, exclusive 
linkage between the G-to-A nucleotide substitution at position 

20 59029 to the T-to-C change at position 59356 presumably 
indicates either gene conversion or simultaneous point mutations. 
If the former is true, other upstream {CCR2b coding region) o r 
downstream (CC/?5 coding region) polymorphisms are likely to be 
linked in the same fashion. The CC/?2^?-64I-59653T linkage served 

25 as one example, and CC/J5-A32-P*0201 linkage served as another. 
Thus, a disease association with an individual polymorphic site can 
actually signal relationships with one or more additional 
polymorphisms at multiple inter-linked positions. 
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TAB! .F. 4 

rrR7h and m?'> prnmnt ftr gp.nnrypes observed in HIV- 1 -infected 
RwanHan women 

5 Prnmntpr afnotyp es FYpertpH freqiiftncy (%) OhSflrVfid freqilfincy (%) 

P*0101/*0101 1.2 3.0 

P*0101/*0102 7.9 6.0 

P*0101/*0103 1.8 0.0 

P*0101/*0201 4.2 4.5 

10 P*0101/*0202 7.9 6.0 

P*0102/*0102 6.8 12.4 

P*0102/*0103 4.3 0 

P*0102/*0201 4.9 6.0 

P*0102/*0202 18.4 15.4 

15 P*0103/*0103 0.7 2.5 

P*0103/*0201 3.1 2.5 

P*0103/*0202 5.8 9.0 

P*0104/*0202 0.2 0.5 

P*0201/*0201 3.6 5.5 

20 P*0201/*0202 13.-3 13.9 

P*0202/*0202 12.5 12.9 
CCR2b genotypes 

64V/I 40.8 42.3 

64I/I 8.2 7.5 
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TABLE 5 

1 intfagft among r.r.R7h CCR'^ and CCR5 promotflr Variants In 

siihjficrs f rom rhe Bronx, New York' 



Variant 1 Variant 2 


N" 


1+/2+ 


1+/2- 


1-/2+ 


1-/2- 


OR 


95% CI 


P 


CCR2b-6Al CCR5-A32 


193 


1 


53 


11 


128 


0.22 


0.03-1.74 


0.118 


CCR2b-64l CCR5 promoter 
















P*0101 


228 


20 


42 


109 


57 


0.25 


0.13-0.46 


0.001 


P*0102 


228 


10 


52 


46 


120 


0.50 


0.24-1.07 


0.071 


P*0201 


228 


23 


39 


99 


67 


0.40 


0.22-0.73 


0.002 


P*0202 * 


228 


62 


0 


7 


159 


23.7 


11.5-49.0 


0.001 


59029G " 


228 


37 


25 


138 


28 


0.3 


0.16-0.58 


0.001 


CCR5-A32 P*0201 


193 


12 


0 


97 


84 


1.87 


1.63-2.14 


0.002 



5 ^Subjects from Bronx included Caucasians (n=58), Hispanics 
(n=135), and African-A o ^ricans (n=35). ''African- Americans 
(n=35) are not included in the analysis involving CC/J5-A32, as this 
variant is not present in this ethnic group. ^Promoter allele P*0202 
exclusively carries 59653T (Table 3). **50029G is exclusively 
10 linked to 59353T, and both occur at equal frequency in this cohort 
regardless of the ethnic group. 
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TARf K 6 

Linkape analysis of rCR2h-641 anH inHiviH nal rCR^ prnmnfor all^l/^c 

and polymorphic sites in a cnhnrr of ? 7S Rwanrtan wnnnpn' 

Variant 1 Variant 2 1+/2+ I+/2- 1-/2+ 1-/2- OR 95% CI 
CC/?2fe-64I Promoter variants 



P*0101 


15 


115 


38 


110 


0.4 


0.20-0.73 


0.003 


P*0102 


33 


97 


87 


61 


0.2 


0.14-0.40 


0.001 


P»0201 


33 


97 


73 


75 


0.4 


0.21-0.58 


0.001 


P*0202 


129 


1 


19 


129 


875.8 


115.5-6639.7 


0.001 


59029G " 


67 


63 


126 


22 


0.2 


0.10-0.33 


0.001 



*C/?5-A32 is absent in this cohort. *'50029G is exclusively linked to 
5 59353T, and both occur at equal frequency in this cohort. 



EXAMPT.E 12 

CCR5 promoter genotypes and HTV-1 infection 

10 Seropositive and seronegative individuals consistently 

differed in their distributions of two CCR5 promoter genotypes: 
the homozygous genotype P*0101/*0101 was more frequent in 
seropositives than in seronegatives (Table 7), while homozygous 
or heterozygous genotypes involving allele P*0201 were more 

15 frequent in seronegatives than in seropositives (Table 7). Despite 
the tight linkages that existed among alleles of CCR2b, the CCR5 
coding region, and the CCR5 promoter, the relationships between 
CCR5 promoter alleles and occurrence of HIV-1 infection could not 
be attributed to either CCR2b or variants in the CCR5 coding region 

20 (Table 8). Homozygosity and heterozygosity for 5DF7-3'A, a 
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polymorphism in the natural ligand of CXCR4 (54), were further 
excluded as a contributing factor. 

Several CCR5 promoter genotypes demonstrated trend 
for association in either combined groups, probably as a result of 
5 ethnically specific effects. For example, the homozygous promoter 
form P*0103/*0103 was found in 6 (or 2.8%) HIV- 1 -infected black 
individuals but was absent from seronegatives (OR = 5.8, p = 
0.097). This relationship was as strong as that recognized earlier 
for the P*0101/*0101 genotype. 

10 The promoter allele P*0103 was largely restricted to 

blacks. Thus, the protective genotypes involving P*0201 could be 
split into 5 major genotypes (P*0101/*0201, P*0 1 027*020 1 , 
P*0103/*0201, P*0201/*0201, and P*0201/*0202) in black 
populations, but genotype P*0101/*0103 was not detected. In the 

15 Caucasoid group, P*0201 can form 4 genotypes, and one of these 
(P*0102/*0201) was rare. The re-analyses revealed that only 
genotypes P*0201/*0201 and P*0201/*0202 were associated with 
protection against infection in Cauicasoids (OR = 0.5, p = 0.032), 
compared with P*0102/*0201, P*0103/*0201, and P*0201/*0201 

20 in blacks (OR = 0.3, p = 0.001). Another P*0201+ genotype 
P*0101/*0201 was equally distributed in seropositive and 
seronegative groups regardless of their ethnicity. Therefore, the 
relative contribution to protection by different genotypes 
involving allele P*0201 varied according to the ethnic background. 

25 The relationships observed on CCR5 promoter 

genotypes remained the same when two additional non-CCR factors 
known to influence the rates of HIV-1 disease progression in this 
cohort were considered. Specifically, homozygosity at HLA class I 
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A or B or both was strongly associated with more rapid disease 
progression in these Rwandan women, while HLA B*57 was 
associated with slower disease progression. Data from 
combinations of promoter and HLA factors strongly suggest that 
5 the protective effects of promoter genotypes P*0101/*0201 and 
P*0101/*0202 are independent of the HLA factors, as were the risk 
effects of P*0201/*0201 and P*0102/*0201 (Table 10). 

The characterization of CCR5 promoter allelic 
sequences provided simple explanations for previous associations 

10 with disease progression (Figure 6) and allowed further 
predictions. First, alleles P*0201 and P*0202 were the only 
promoter variants carrying the previously recognized disease- 
accelerating Pl/Pl genotype (6). These two alleles carried 
exclusively 59029A and not the disease-delaying 59029G/G (5). 

15 As a result, Pl/Pl homozygotes and 59029G/G-carriers 
represented 2 mutually exclusive groups of promoter genotypes. 
The association of the PI /PI -carrying P*0201/*0201 with more 
rapid disease progression as revealed by our data supported and 
refined the known effects of Pl/Pl. 

20 Second, as noted above, P*0202 (59653T) is exclusively 

linked to CCR2b-(A\ (Tables 5 & 6), which has been associated 
with slower disease progression in cohorts of seroconverters 
(2,4,11,12). Thus, the relationship between Pl/Pl genotype and 
rapid HIV-1 disease progression must depend upon P*0201 and not 

25 the CC/?2fc-64Minked P*0202 (Figure 6B). The lack of association 
between CCR2b-6A\ and slower disease progression in Rwandan 
women was seemingly inconsistent with findings from another 
African cohort (34). However, exclusive linkage of 59653T to 
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CCR2b-64l (2), which in turn solely defines promoter allele 
P*0202, made these findings easily explainable. In other earlier 
studies, CCR2b-64l variant has been less clearly associated with 
slower disease progression in certain seroprevalent groups than in 
5 cohorts of seroconverters (2,4,10-12), perhaps as much because 
of their genetic heterogeneity as because of misclassification with 
respect to outcome. The contrasting relationships seen here with 
the CC/?2i>-64I-carrying P*0101/*0202 (protective) and 
P*0201/*0202 (risk) indicated clearly that the CCR2b-64l effect 

10 can go either way dependending upon the promoter genotypes. As 
a result, the CCR2b-64l effect must be secondary to 
polymorphisms in the promoter, which can vary significantly 
among different ethnic groups. The association between the PI /PI 
variant and an accelerated course independent of CCR2b-64l (6), 

15 coupled with the preliminary demonstration of reduced promoter 
activity (5) in carriers of the 59029G variant that includes P*0101 
and P*0102, enhances the likelihood that the effects of promoter 
polymorphisms are independent of CCR2b-64l. 
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TARI.K 7 

Di.Strihlltion nf CCR.") prnmotpr gannrypfts in ethni c Afnrang anH 

ethnic ranfiasians 



Promoter 


HIV+ 


HIV- 




OR 


Caucasoids Caucasoids OR 


GenotvDe^; 








( n\ 
\P ) 


HIV+ 


Hrv- 


(P) 




(n=213) 


(n=IOO) 






(n=75) 


(n=l 18) 


P*0101/*0101 


3.3 (7) b 


0 


6.8 


(0.065) 


21.3 (16) 


7.6 (9) 


3.1(0.010) 


p*Oi 01 /*n 1 n? 


U.vJ \^^) 








12.0 (9) 


9.3 (11) 




P*0101/*0201 


4.7 (10) 


6.0 (6) 






25.3 (19) 


28.8 (34) 


- 


P*0101/*0202 


5.6 (12) 


5.0 (5) 




- 


8.0 (6) 


12.7 (15) 


- 


P*0102/*0102 


12.2 (26) 


10.0(10) 






0 


1.7 (2) 




P*0102/*0103 


0.5 (1) 


4.0 (4) 


0.1 


(0.038) 


1.3 (1) 


0 




P*0102/*0201 


5.6 (12) 


19.0(19) 


0.3 


(0.001) 


4.0 (3) 


3.4 (4) 




P*0 1027*0202 


15.0 (32) 


10.0(10) 






8.0 (6) 


2.5 (3) 


3.3(0.0? ?..> 


P*0103/*0103 


2.8 (6) 


0 


5.8 


(0.097) 


0 


0 




P*0103/*020l 


2.8 (6) 


8.0 (8) 


0.3 


(0.042) 


2.7 (2) 


0 




P*0103/*0202 


9.9 (21) 


4.0 (4) 


2.6 


(0.054) 


1.3 (1) 


3.4 (4) 




P*0201/*0201 


5.2 (11) 


7.0 (7) 






6.7 (5) 


15.3 (18) 


0.4(0.073) 


P*0201/*G202 


13.1 (28) 


12.0(12) 






9.3 (7) 


14.4 (17) 




P*0202/*0202 


12.2 (26) 


6.0 (6) 


2.2 


(0.092) 


0 


0.8 (1) 




* One of the 


infected 


individual 


in 


this group carried 


the rare 





genotype 



P*0104/*0202. ^ Numbers are expressed as % (n). 
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TABLE 9 

Differential distrihntion of rr/?'? prnmnfftr gennryp^s in HTV-14. 

Rw a ndan wnmrn with varvinp rates of Hispagp prngr^ccipn anH a 

comp a rison — with previniislv rftrngnizpH pmmnfPr va r iants anH th^ 
5 closely relatftH rCRlh pftnntyp<'^ 

RPs" IPs» SPs" /? for 

Observed genotypes (N =15) (N = 86) (N = 100) trend 
CCR5 promoter genotypes " defined here 

1. P*0101/*0201 0 2.3 (2) ' 6.9 (7) 0.083 

2. P*0101/*0202 0 1.2 (1) 10.9 (11) 0.005 

1 + 2 0 3.5 (3) 17.8 (18) 0.001 

3. P*0101/*0102'' 13.3 (2) 5.8 (5) 5.0 (5) 0.318 
4. P*0101/*0103'':not found - - . . 

5. P*0101/*0101'* 6.7 (1) 1.2 (1) 4.0 (4) 
3 + 4 + 5 = 3 + 5" 20.0 (3) 7.0 (6) 9.0 (9) 

1+2 + 3+4 + 5= 20.0 (3) 10.5 (9) 27.0 (27) 0.030 
1+2 + 3 + 5*" 

6. P*0102/*0202 26.7 (4) 18.6 (16) 11.0 (11) 0.058 

7. P*0201/*0201 13.3 (2) 7.0 (6) 3.0 (3) 0.072 

6 + 7 40.0 (6)25.6 (22) 14.0 (14) 0.007 

8. P*0202/*0202 6.7 (1) 14.0 (12) 13.0 (13) 



Promoter genotypes recognized 
59029G/G' 
59029A/A' = Pl/PV 
59029G/A 



previously 

26.7 (4)20.9 (18) 26.0 (26) 

33.3 (5)34.9 (30) 30.0 (30) 

40.0 (6)44.2 (38) 43.6 (44) 
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CCR2b genotypes 
CCR2b-64yW + 64I/I 



53.3 (8) 50.0 (43) 49.0 (49) 
46.6 (7)43.0 (37) 41.0 (41) 
6.7 (1) 7.0 (6) 8.0 (8) 



CCR2b-64l/V 



CCR2b-64l/64l 



'The three groups of patients (see text) were classified according to 
their rates of disease progression: RPs=rapid progressors=deaths 
from AIDS in <6 years; SPs=slow progressors=no clinical 
manifestation of HI V- 1 -related diseases in >10 years; 

5 IPs=intermediate and indeterminant progressors with less extreme 
clinical outcomes. "Fiftee:. major CCR5 promoter genotypes involving 
5 major alleles were defined in this cohort. Only those involving 
alleles that appeared to distribute disproportionally (p iO.l) are 
shown. 'Numbers are expressed as % (n). "/.l? genotypes carry 

10 homozygous 59029G/G. '59029G/G (5) are repre^^eiited by 6 major 
genotypes in this cohort: P*0101/*0101, P*0101/*0102, 
P*0101/*0103, P*0102/*0102, P*0102/*0103, P*0103/*0103; none of 
them appeared to have biased distribution among the three groups of 
subjects. The PI genotype (6) consists of 2 distinct alleles (*0201 

15 and *0202) in this cohort as defined by the polymorphism at 
nucleotide position 59653 (GenBank U59626). 
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TARf.F 10 



Ana1yf;e5s of comhination?; q£ di?;ea<;e-related factors spe.ni f i n i a 1 1 y 

recogni/ed in HTV-1 seropositive Rwandan women 



Promoter Presence or absence of 


RPs 


IPs 


SPs 


p for 


genotypes other recognized factors 


%(n) 


%(n) 


%(n) 


trend 


1 + 2 » + Hmz " at A, B, or both 


0 


0 


2.0 (2) 


0.193 


+ Htr " at A, B, or both 


0 


3.5 (3) 


16.0 (16) 0.002 


+ B*57 


0 


0 


3.0 (3) 


0.109 


- 3*57 


0 


3.5 (3) 


15.0 (15) 


0.009 


- Hmz » - B*57 


0 


3.5 (3) 


13.0 (13) 


0.010 


P*0201/*0201+Hmz " at A, B, or both 


6.7 (1) 


1.2 (1) 


0 


0.037 


+ Htr "» at A, B, or both 


6.7 (1) 


5.8 (5) 


3.0 (3) 


0.327 


+ B*57 


0 


1.2 (1) 


0 


0.500 


- B*57 


13.3 


5.8 (5) 


3.0 (3) 


0.096 




(2) 








- Hmz » - B*57 


6.7 (1) 


4.7 (4) 


3.0 (3) 


0.378 


P*0102/*0202+Hmz " at A, B. or both 


6.7 (1) 


7.0 (6) 


3.0 (3) 


0.250 


+ Htr " at A, B, or both 


20.0 


11.6(10) 


8.0 (8) 


0.155 




(3) 








+ B*57 


0 


3.5 (3) 


2.0 (2) 


0.934 


- B*57 


26.7 


15.1 (13) 


9.0 (9) 


0.045 




(4) 








- Hmz " - B*57 


20.0 


10.5 (9) 


6.0 (6) 


0.070 




(3) 









» 1 + 2 = P*0101/*0201 and P*0101/*0202, both involving the 
5 59029G-carrying P*0101. ^ Hmz, homozygosity; Htr, heterozygosity. 



EXAMPl.K 13 

Mediation nf HTV-1 infection hy rr/?5 promoter genotypes is 

10 independent of effects hy non-genetic factors 
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Age, ethnicity and sexual and drug-using habits usually 
alter the risk of acquiring HIV-1 infection. For example, among 
heterosexual women in Kigah, the observed high risk factors 
included: 1) currently living alone but having multiple sexual 
5 partners or currently living with a single partner but both partners 
having extra-marital affairs (OR = 1.8, p - 0.024); 2) history of 
sexually transmitted disease in the past five years (OR = 1.5, p = 
0.022); 3) having multiple life-time sexual partners (OR = 1.7, p = 
0.004); and 4) no condom use during sexual intercourse in the last 2 

10 years (OR = 8.6, p = 0.0001). Risk factor 1 was strongly correlated 
with risk factor 2 among the Rwandan women (x^ = 20.3, p = 0.001). 
Accordingly, the more strongly related risk factor 2 was chosen for 
additional analyses. For injecting drug users in New York, high risk 
factors included: 1) having sex with IV drug users (OR = 2.6, p = 

15 0.039); 2) sharing needles in shooting galleries (OR = 1.3, p = 0.006); 
3) attending shooting galleries (OR = 1.5, p = 0.065); and 4) sharing 
needles with strangers (OR = 1.01, p = 0.052), 

Univariate analysis revealed that patterns of behavior 
(with regard to sex in Kigali and both sex and drug injection in New 

20 York) represented the strongest risk of getting infected. An ordinal 
scoring scheme summarizing these behavioral factors demonstrated a 
strong association with risk in the Kigali, but not in the New York 
cohort, suggesting that the latter group was strongly biased as a 
result of selective enrollment. Stratification of subjects by behavioral 

25 risk score revealed that CCR5 promoter allele P*0201 conferred more 
stable protection on subjects with the lowest risk score than on those 
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With higher scores in Kigali ( high risk OR = 0.4, 95% CI = 0.0-7.2; low 
risk OR = 0.4, 95% CI = 0.2-0.7). In contrast, individuals with 
intermediate to higher risk scores were belter protected by P*0201 in 
New York (high risk OR = 0.7-0.9; low risk OR = 0.3-0.6) (Table 11). 
The risk associated with P*0101/*0101 genotype diminished in the 

New York cohort. 

To assess the simultaneous and independent genetic and 
non-genetic effects, variables for age, behavioral risk, ethnicity and 
the two contributing CCR5 promoter genotypes (P*0101/P*0101 and 
P*0201) were incorporated into logistic regression analyses 
performed on separate data from KigaU and New York populations 
(Table 11). Age and risk score consistently served as independent 
determinants of HIV-1 infection in both cohorts. In Kigali, CCR5 
promoter genotypes involving P*0201 were associated with decreased 
risk; in New York, the P*0101/*0101 genotype was associated with 
increased risk. Modest and inconsistent interaction between age and 
risk score and between ethnicity and CC/?5 promoter variants did not 
contribute significantly to risk in the regression model. The two 
genotypes associated with contrasting outcomes of infection in the 
two cohorts are mutually exclusive of each other (Table 3), and their 
reciprocal effects are in agreement with apparent differences in their 
frequencies in the two populations (Figure 4). Thus, it became 
evident that associations of the two promoter genotypes 
P*0101/P'010i and P*0201 with higher and lower risk of HIV-1 
infection, respectively, were independent of the effects of 
demographic and behavioral factors. 
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TARr.K 11 

UnivariatP and multivariate rp.grp.ssinn analyg<»s <:hr^u/;ng th(^ 

i ndRDftndftnt ur<\ rRlative cnntrihnrinn tn HTV-1 inff^rti^^ Uy g^n^Hr 
and non-pRneric factors in two f^nhnrfs 

5 



I. Logistic model for univariate analyses of the New York data set 


Factors considerftH 


OR (95% ri) 


A 


Having sex with IV drug user 


2.6 


(1.0-6.3) 


0.039 


bnanng needles m shooting galleries 


1.3 


(1.1-1.6) 


0.006 


Attending shooting galleries 


1.5 


(1.0-2.4) 


0.065 


Sharing needles with strangers 


1.0 


(1.0-1.0) 


0.052 


Age 


0.9 


(0.9-0.9) 


0.005 


Ethnicity 






Caucasian Arnericans 


0.6 


(0.3-1.1) 


0.110 


African Americans 


0.8 


(0.4-1.8) 


0.609 


Hispanics 


1.7 


(1.0-2.9) 


0.073 


CCR2b-64l 


0.9 


(0.5-1.7) 


0.840 


CCR5-D32 


1.7 


(0.5-5.3) 


0.390 


CCR5 promoter P*0101/*0101 


3.6 


(1.5-8.4) 


0.004 


CCR5 promoter P*0201 


0.5 


(0.3-0.9) 


0.020 


SDFI-yA 


1.5 


(0.8-2.8) 


0.173 



II. Multivariate analyses of the New York data set 

HIV-1 infection = 1.60 + 0.31 x (sharing needles) - 0.06 x age - 0.83 x 
(being white) + 1.41 x {CCR5 promoter P*0101/*0101) 

Sharing needles OR = 1.4, CI = 1.1-1.7, p = 0.003 

Age OR = 0.9, CI = 0.9-1.0. p = 0.014 

Being white OR = 0.4, CI = 0.2-0.9, p = 0.021 
CCR5 P*0101/*0101 OR = 4.1, CI = 1.7-10.2, p = 

0.002 

III. Logistic model for univariate analyses of the Kigah data set 

Factnrs rnnsidfirRfl OR rQS% rj) ^ 
Venereal disease in past 5 years 2.5 (1.2-5.1) 0.016 
Condom use 2 yrs prior to seroconversion 0.02 (0.0-0.1) 0.000 
Number of lifetime sexual partners 1.5 (1.09-2.0) 0.011 
Age 0.9 (0.83-0.9) 0.000 
CCR2b-6A\ 1.6 (0.91-2.6) 0.108 
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CCR5 promoter P*0101/*0101 999.0 ( 0-0.9 00 

999.0) 

CCR5 promoter P*0201 0.4 (0.3-0.7) 0.001 

SDFl-yA 1-35 (0.65-2.81)0.417 

IV. Multivariate analyses of the Kigali data set 

HIV-1 infection = 7.36 + 1.18 x (venereal disease) - 4.54 x (condom 
use) - 0.09 X age - 0.87 x (all CCR5 promoter genotypes involving P*0201) 

Venereal disease in past 5 years OR=3.2,CI=1.4-7.7.p =0.007 

Condom use 2 yrs prior to seroconversion OR=0.02,CI=0.0-0.1,p 

=0.000 

Age OR=0.9,CI=0.8-1.0,p =0.024 
CCR5 promoter P*0201 OR=0.4,CI=0.2-0.8,p =0.011 



yVAMPTF 14 

5 Thf nh«;p.rvpH relaf ir>n«;hips hpfwpftn CCR') promoter — polymnfphi l SmS 

anH HTV-1 infprtinn Hiffcred fmm — the — prp . vioiisly — rccngn i 7 , ed 

}^ggr>r.iatinns 

To compare previously reported relationships of CCR5 
promoter sequences to progression with their relationships to 

10 infection seen here, the CCR5 promoter alleles were further 
aggregated into groups as defined by earlier methods. The effects of 
the P*0202-linked CCR2b-64l variant were also tested. By this re- 
analysis, the 59029G/G variant previously associated with slower 
disease progression and the Pl/Pl genotype previously associated 

15 with faster disease progression separated mutually exclusive subsets 
of the promoter genotypes, explaining their opposing effects. 
However, the previously recognized relationships between these two 
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CCR5 promoter variants and HIV-1 disease progression almost 
contradicted their relationships to infection. For example, the 
59029G/G (= all combinations involving only P*0101, P*0102, P*0103, 
and P*0104) was previously associated with slower disease 
5 progression (5). In the population cohorts described herein, 
P*0101/P*0101 (a subset of 59029G/G) showed unequivocal 
relationship to increased risk of infection (Table 8). Here, 
differences in population allele frequencies are unlikely to be a 
sufficient explaination. 

10 The effects of the Pl/Pl genotype differed even between 

the two cohorts: in KigaU, the Pl/Pl genotype was associated with 
becoming infected, whereas in New York, the same genotype was 
associated with remaining uninfected. The effects of CCR2b-64l were 
the least stable when the 4 ethnic groups were analyzed separately. 

15 These apparently self-contradictory results contrast starkly with the 
strength and consistency of the relationships established by the 
typing scheme used herein, which had a definitive resolution of the 
promoter genotypes (Table 3). Thus, the manisfestation of CCR 
relationships appeared to be sensitive to ethnicity and allele 

20 definitions. 

,<;iimmary 

Intensive effort has been devoted to measuring the 
influence of polymorphisms in CCR5 and its promoter on the rates of 
disease progression during HIV-l infection. The almost complete 
25 absence of HIV-l infection in CCi?5-A32 homozygotes has highlighted 
the importance of this co-receptor for initial penetration by the 
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virus. It is conceivable that reduced production of CC/?5 mediated by 
the promoter genotypes (5) can further determine the infectability 
by HIV-1. This aspect has heretofore escaped much of the attention 
surrounding the CCR5 promoter polymorphisms. 
5 Based on earlier data derived from various ethnic groups, 

the four disease-related promoter alleles P*0101, P*0102, P*0201 and 
P*0202 differ in their frequencies in different populations. In 
particular, the promoter allele P*0201 occurs more frequently in 
Caucasoids (Caucasians and Hispanics, 34%) than in populations of 

10 African origin (22%). If the P*0201/*0201 (a subset of Pl/Pl) truly 
mediates more rapid progression to disease following HIV-1 
infection, then a higher proportion of rapid progressors can be 
attributed to t^ is genotype in Caucasians (11-12%) than Africans (4- 
5%). It would, taerefore, be more difficult to detect the effect of 

15 P*0201/*0201 or Pl/Pl genotype in populations of African origin. 
The lack of association between Pl/Pl and rapid disease progression 
in African-Americans (6) reflects this difficulty, but a better 
resolution of the Pl/Pl genotype is likely to pinpoit the exact alleles 
involved. 

20 Despite the consistent associations between CCR5 

promoter variants and contrasting rates of HIV-1 disease progression, 
the mechanisms for these effects are still puzzling. For example, 
none of the polymorphic promoter sites map within any of the 6 
putative transcription factor-binding (TFB) elements (13,14), 

25 suggesting that other unknown neighboring factors in linkage 
disequilibrium are probably involved. 
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In summary, the work described herein confirmed the 
association of CCR5 genotype Pl/Pl and further indicated that only 
the subset represented by the P*0201/*0201 genotype was the true 
factor in the Rwandan populatrion studied. Meanwhile, two 
5 promoter genotypes, P*0101/*0201 and P*0101/*0202, collectively 
and individually showed the opposite effects. The appearance and 
strength of the association of CCR2b-64l obviously depended on the 
relative frequencies of 2 competing CCR5 promoter genotypes, 
P*0102/*0202 and P*0201/*0202, both involving the CCR2b'64l' 

10 carrying P*0202 allele. In addition, the disease-accelerating effect of 
P*0201/*0201 is predicted to be more significant to Caucasian and 
Hispanic populations, whose P*0201 frequencies are higher than 
those of populations of African origin. 

The consistent effects associated with CCR5 promoter 

15 alleles in ethnically diverse populations studied herein are less 
dramatic compared with those associated with CC/?5-A32 and CCR5- 
m303. It is unlikely that a simple mechanism can explain all of the 
observed relationships. The exisitng data pool already points to a 
complex phenomenon that depends on a combination of differences 

20 in promoter activities (5), in splicing of CCR5 transcription products 
(13), and in the relative prevalence of M- and T-tropic quasispecies 
at different stages of HIV-1 infection (40,55). There is also limited 
evidence that the co-receptor usage by HIV-1 may be governed not 
only by CCR5 expression (37,52), but also by the ratio between CCR5 

25 coreceptor and CD4* on the cell surface (19). If the relative 
expression of CCR5 and CD4* is altered after infection, the effects 
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exerted by a promoter genotype at the inception of HIV-1 infection 
could appear very different from those seen at later stages. 

There are three single nucletide changes between the 
promoter alleles P*0101 and P*0201, however none of them occur in 
5 any of the six putative transcription factor-binding elements. If the 
different effects associated with these two alleles have anything to do 
with differential transcription of CCR5, there are probably additional 
unrecognized elements in the promoter region. Meanwhile, two of 
the three nucleotide changes between CCR5 promoter alleles P*0101 

10 and P*0201 are located in the untranslated region (exon 1) of CCR5 
mRNA. These sites (59353 and 59402) may alter the translation 
rather than transcription. The relatively high frequency of promoter 
alleles P*0101 and P*0201 in different ethnic groups suggested that 
these factors can be more significant than the two Caucasian-specific 

15 factors in the CCR5 locus previously shown to mediate HIV-1 
transmission. 



BNSDOCJD: <W0 Cm?857A2_L> 



69 



wo 01/12857 



PCT/USOO/22255 



TABLE 12 

Allele assipnment for CCR2h. rCR5 and C.C.R'i promnrer alleles haspH 
upon PCR-S.SP 





Genotypes 


DNA band(s) 


Notes 


5 




visualized in 








SSP reaction 






CCR2b-64V 


la=413 


bp(++++) 


appiIUaUlC \0 nil Illy 




1997) 








10 


CCR2b-64I 


lb=413 


bp(++++) 


lb always has 4b=309 




bp(++++) 










CCR5 wild tvoe 


lb=585 


bp(+++) 


1 97 hn contrnl nroHnrt 




CCR5-A32 


lb=553 


bD(+++) 


165 bp control product 




P*0101 


2a=363 


bof ++++V 




15 




3a=412 


bp(++++); 


extra 5a=367 bp(++++) 






4a=309 


bp(++++) 






P*0102 


2a=363 


bD{++++): 








3b=412 




extra Sa='^^i7 i-^ 


20 




4a=309 


bp(++++) 






P*0103 


2a=363 


bp(+/-); 








3b=412 


bp(++++); 


extra 2e=363 bp(++++), 






4a=309 


bp(++++) 


5b=367 bp(++++) 


25 


P*0201 


2d=363 


bp(++++); 








3d=412 


bp(++++); 


NA 






4a=309 


bp(++++) 






P*0202 


2d=363 


bp(++++); 


4b(+++) usually has 


30 




3d=412 


bp(++++); 


lb=413 bp(++++) 






4b=309 


bp(+++) 
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The intensity of specific products is measured as: ++++ very strong; 
+++ strong; -f+ clearly visible; + weak; +/- weak or invisible. See also 
Figure 2. 



5 EXAMPLE 15 

CCK5 GftnOtypftS Determine Progression nf H TV-1 Tnfpcrinn hy 

Regulating Early Viral Load 

Up to 306 mostly Caucasians homosexual HIV-1 
seroconverters from the M'^lticenter AIDS Cohort Studies and District 

10 of Columbia Gay Cohort had viral load (VL) measurements 6-36 
months after seroconversion. Men were typed by multiplexing PGR 
with sequence-specific primers that simultaneously defined markers 
such as: CC/?2^-64I and ci.y-regulatory dimorphisms G303A, T627C, 
C630T, A676G, C927T (including the Pl/Pl pairing reported in 

15 association with rapid progression), and A32 in CCR5 (Figure 7). 
Viral load (Chiron or Roche) was reported as log,©. The GLM 
procedure in SAS was used to model the relationships. 

Age-, race-, cohort-adjusted mean VL at a median of about 
15 months was 4.33. Overall A32 carriers showed clearly lower ML 

20 (4.06, p=0.008), 641 showed only a modest reduction (VL=4.21) and 
Pl/Pl showed no reduction (VL=4.38). In a multivariate model, 
compared with the reference group (n=224), men who carried A32 in 
the absence of the homozygous A-C-C-A-C haplotype (a subset of 
Pl/Pl) had lower VL both in the presence (3.38, p=0.03) and the 

25 absence (4.01, p=0.001) of the 641-linked A-C-C-A-C haplotype. 
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Conversely, men who carried the homozygous A-C-C-A-C haplotype i n 
the absence of A32 had higher VL (4.52, p=0.05). The overall 
adjusted model for association of CCR genotypes with VL was highly 
predictive (p=0.0005). 
5 CCR5 genotype HHE/HHE (Figure 9) contributed the 

majority of effect on VL and disease progression, causing >2-year 
acceleration to the onset of AIDS in 10% of the participants. The 
infrequent CCR5 genotype carrying A32 (HHG*2) and 641 (HHF*2) had 
a strong protective effect (lower VL and longer time to AIDS) in 2% of 

10 the participants. CC/?2-64I alone conferred slight but not significant 
protection against HIV-1 VL or disease progression. 

In conclusion, comprehensive genotyping of CCR2b and 
CCR5 regulatory and coding region variants coupled with analysis 
accounting for linkage disequilibria between the variants indicated 

15 that A32, the 64Hinked A-C-C-A-T CC/?5 promoter haplotype, and the 
homozygous A-C-C-A-C genotype collectively determine later disease 
progression partly by governing viral load early in infection. 
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Any patents or publications mentioned in this 
specification are indicative of the levels of those skilled in the art to 
which the inveiuion pertains. Further, these patents and publications 
are incorporated by reference herein to the same extent as if each 
5 individual publication was specifically and individually indicated to 
be incorporated by reference. 

One skilled in the art will appreciate readily that the 
present invention is well adapted to carry out the objects and obtain 
the ends and advantages mentioned, as well as those objects, ends 

10 and advantages inherent herein. The present examples, along with 
the methods, procedures, treatments, molecules, and specific 
compounds described herein are presently representative of 
preferred embodiments, are exemplary, and are not intended as 
limitations on the scope of the invention. Changes therein and other 

IS uses will occur to those skilled in the art which are encompassed 
within the spirit of the invention as defined by the scope of the 
claims. 
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WHAT IS CLAIMED IS: 

1. A method of surveying CCR genotypes in a 
population, comprising the steps of: 

(a) obtaining biological samples from a representative 
5 number of individuals in a population, wherein each sample is from a 

different individual, wherein said sample comprises genomic DNA; 

(b) combining a portion of each sample with an 
experimental primer combination and a control primer combination, 
thereby producing primer-annealed DNA, wherein said experimental 

1 0 primer combination is selected from the group consisting of SEQ ID 
Nos. 2 & 3, 2 & 4, 5 & 7. 5 & 8, 5 & 9, 5 & 10, 5 & II, 5 & 12, 5 & 13, 
6 & 7, 6 & 8, 6 & 12, 6 & 13. 14 & 16 and 15 & 16, wherein said 
control primer combination is SEQ ID Nos. 17 & 18; 

(c) amplifying said primer-annealed DNA in a reaction, 
15 thereby producing amplicons, wherein reaction conditions for said 

amplification are optimized for sequence- specific amplification, 
wherein each experimental primer combination and said control 
primer combination are predicted to produce one or more amplicons 
having expected sizes in basepairs; 
20 (d) separating said amplicons by size, wherein the 

presence of: 

a 197 bp amplicon with said control primer combination 
is indicative of a CCR5 wildtype coding sequence; 
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a 165 bp amplicon with said control primer combination 
is indicative of a CCR5-A32 coding sequence; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10. a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 «& 1 6 
is indicative of a P*0101 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0102 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 9. a 367 bp amplicon with experimental 
primer combination SEQ ID Nos. 5 & 11, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0103 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
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experimental primer combination SEQ ID Nos. 14 & 16 is indicative of 
a P*0201 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 ifc 8, a 412 bp amplicon with experimental primer 
5 combination SEQ ED Nos, 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative of 
a P*0202 CCR5 promoter allele; 

(e) determining a CCR genotype for each sample based 
upon said CCR alleles indicated following step (d); and 

10 (f) compiling said genotypes determined in step (e), 

thereby genotyping said representative number of individuals in said 
population, thereby surveying CCR genotypes in said population. 

2. The method of claim 1, wherein said biological 
15 samples are selected from the group consisting of blood, serum, 

saliva, semen, tissue biopsy and DNA. 

3. The method of claim 1, wherein said separation is 
selected from the group consisting of gel electrophoresis and size 

20 fractionation. 

4. The method of claim 1, wherein said amplifying is 
by polymerase chain reaction (PCR). 
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5. A method of surveying HIV-1 co-receptor CCR alleles 
in an individual, comprising the steps of: 

(a) obtaining a biological sample from an individual, 
wherein said sample comprises genomic DNA; 

5 (b) combining a portion of said sample with at least one 

experimental primer combination and a control primer combination, 
thereby producing primer-annealed DNA, wherein said experimental 
primer combinations are selected from the group consisting of SEQ 
ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 5 & 1 1, 5 & 12, 5 & 
10 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein said 
control primer combination is SEQ ID Nos. 17 & 18; 

(c) amplifying said primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for said 
amplification are optimized for sequence-specific amplification, 

15 wherein each experimental primer combination and said control 
primer combination are predicted to produce one or more amplicons 
having expected sizes in basepairs; and 

(d) separating said amplicons by size, wherein the 
presence of: 

20 a 197 bp amplicon with said control primer combination 

is indicative of a CCR5 wild type coding sequence; 

a 165 bp amplicon with said control primer combination 
is indicative of a CCR5-A32 coding sequence; 
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a 363 bp amplicon with experimental primer combination 
SEQ ED Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
5 amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0101 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
10 experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ED Nos. 14 & 1 6 
is indicative of a P*0102 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 363 bp amplicon with experimental primer 
15 combination SEQ ID Nos. 5 & 9, a 367 bp amplicon with experimental 
primer combination SEQ ID Nos. 5 & 11, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0103 CCR5 promoter allele; 

20 a 363 bp amplicon with experimental primer combination 

SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative of 
a P*0201 CCR5 promoter allele; 
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a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8. a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative of 
5 a P*0202 CCR5 promoter allele. 



6. The method of claim 5, wherein said biological 
samples are selected from the group consisting of blood, serum, 
saliva, semen, tissue biopsy and DNA. 



10 



7. The method of claim 5, wherein said separation is 
selected from the group consisting of gel electrophoresis and size 
fractionation. 



8. The method of claim 5, wherein said amplifying is 
by polymerase chain reaction (PGR). 



9. A method of predicting the disease progression of 
AIDS in an HIV- 1 -infected individual, comprising the steps of: 

(a) obtaining a biological sample from an individual, 
wherein said sample comprises genomic DNA; 

(b) combining a portion of said sample with at least one 
experimental primer combination and a control primer combination, 
thereby producing primer-annealed DNA, wherein said experimental 
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primer combinations are selected from the group consisting of SEQ 
ID Nos. 2 & 3, 2 «fe 4. 5 «& 7, 5 & 8. 5 & 9, 5 «& 10, 5 & 11, 5 & 12, 5 & 
13, 6 «fe 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein said 
control primer combination is SEQ ID Nos. 17 & 18; 
5 (c) amplifying said primer-annealed DNA in a reaction, 

thereby producing amplicons, wherein reaction conditions for said 
amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and said control 
primer combination are predicted to produce one or more amplicons 
10 having expected sizes in basepairs; 

(d) separating said amplicons by size, wherein the 
presence of: 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 

15 combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative of 
a CCR5 promoter genotype of P*0201/P*0201; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 «& 7, a 367 bp amplicon with experimental primer 

20 combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 8, a 
412 bp amplicon with experimental primer combination SEQ ID Nos. 
6 & 13, a 309 bp amplicon with experimental primer combination 

25 SEQ ID Nos. 14 & 16 and a 309 bp amplicon with experimental primer 
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combination SEQ ID Nos. 15 & 16 is indicative of a CCR5 promoter 
genotype of P*0102/P*0202; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
5 combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ED Nos, 5 & 12, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 8, a 
412 bp amplicon with experimental primer combination SEQ ED Nos. 
6 & 13 and a 309 bp amplicon with experimental primer combination 
10 SEQ ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
P*0101/P*0201; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental priwir 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 

IS experimental primer combination SEQ ID Nos. 5 & 12, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 8, a 
412 bp amplicon with experimental primer combination SEQ ID Nos. 
6 & 13, a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 and a 309 bp amplicon with experimental primer 

20 combination SEQ ID Nos. 15 & 16 is indicative of a CCR5 promoter 
genotype of P*0101/P*0202; 

wherein a CCR5 promoter genotype of P*0201/P*0201 or 
P*0102/P*0202 is predictive of an accelerated rate of AIDS 
progression in said individual relative to an individual who does not 
25 possess said P*0201/P*0201 or P*0102/P*0202 gentoype, wherein a 
CCR5 promoter genotype of P*0101/P*0201 or P*0101/P*0202 is 
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predictive of a slower rate of AIDS progression in said individual 
relative to an individual who does not possess said P*0101/P*0201 or 
P*0101/P*0202 gentoype, 

5 10. The method of claim 9, wherein said biological 

samples are selected from the group consisting of blood, serum, 
saliva, semen, tissue biopsy and DNA. 

11. The method of claim 9, wherein said separation is 
10 selected from the group consisting of gel electrophoresis and size 

fractionation. 

12. The method of claim 9, wherein said amplifying is 
by polymerase chain reaction (PCR). 

15 13. A method of predicting the probability of HIV-1 

infection in an individual, comprising the steps of: 

(a) obtaining a biological sample from an individual, 
wherein said sample comprises genomic DNA; 

(b) combining a portion of said sample with at least one 
20 experimental primer combination and a control primer combination, 

thereby producing primer-annealed DNA, wherein said experimental 
primer combinations are selected from the group consisting of SEQ 
ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 5 & II, 5 & 12, 5 & 
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13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein said 
control primer contibination is SEQ ID Nos. 17 & 18; 

(c) amplifying said primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for said 

5 amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and said control 
primer combination are predicted to produce one or more amplicons 
having expected sizes in basepairs; and 

(d) separating said amplicons by size, wherein the 
1 0 presence of: 

a 363 bp with experimental primer combination SEQ ID 
Nos. 6 & 8, a 412 bp with experimental primer combination SEQ ID 
Nos. 6 & 13 and a 309 bp with experimental primer combination SEQ 
ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
15 P*p201/P*0201; 

a 363 bp with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp with experimental primer combination SEQ ID 
Nos. 5 & 10, a 412 bp with experimental primer combination SEQ ID 
Nos. 5 & 12 and a 309 bp with experimental primer combination SEQ 
20 ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
P*0101/P*0101; 

wherein a CCR5 promoter genotype of P*0201/P*0201 is 
predictive of a decreased probability of HIV-1 infection in said 
individual relative to an individual who does not possess said 
25 P*0201/P*0201 genotype, wherein a CCR5 promoter genotype of 
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P*0101/P*0101 is predictive of an increased probability of HIV-1 
infection in said individual relative to an individual who does not 
possess said P*0101/P*0101 genotype. 

5 14. The method of claim 13, wherein said biological 

samples are selected from the group consisting of blood, serum, 
saliva, semen, tissue biopsy and DNA. 

15. The method of claim 13, wherein said separation is 
10 selected from the group consisting of gel electrophoresis and size 

fractionation. 

16. The method of claim 13, wherein said amplifying is 
by polymerase chain reaction (PGR). 

15 

17. A method of correlating COl genotypes with HIV-1 
transmission and/or disease progression, comprising the steps of: 

(a) obtaining biological samples from a representative 
number of individuals, wherein each sample is from a different 

20 individual, wherein said sample comprises genomic DNA; 

(b) assessing each individuaFs HIV-1 status and/or risk 
of acquiring HIV-1; 
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(c) assigning each individual to a risk group, wherein 
said assignment is based upon said individual's HIV-l status and/or 
risk of acquiring HIV-l; 

(d) combining a portion of each sample with at least 
5 one experimental primer combination and a control primer 

combination, thereby producing primer-annealed DNA, wherein said 
experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7. 5 & 8, 5 & 9, 5 & 10, 5 
& 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8. 6 & 12, 6 & 13, 14 & 16 and 15 & 
10 16. wherein said control prim*- combination is SEQ ID Nos. 17 & 18; 

(e) amplifying said primer-annealed DNA in a reaction, 
thereby producing amplicons. wherein reaction conditions for said 
amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and said control 

15 primer combination are predicted to produce one or more amplicons 
having expected sizes in basepairs; and 

(f) separating said amplicons by size, wherein the 
presence of: 

a 197 bp amplicon with said control primer combination 
20 is indicative of a CCR5 wildtype coding sequence; 

a 165 bp amplicon with said control primer combination 
is indicative of a CCR5-A32 coding sequence; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
25 combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
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experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0101 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
5 SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0102 CCR5 promoter allele; 
10 a 363 bp amplicon with experimental primer combination 

SEQ ID Nos. 5 & 7, a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 9, a 367 bp amplicon with experimental 
primer combination SEQ ID Nos. 5 & 11, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
15 amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0103 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
20 experimental primer combination SEQ ID Nos. 14 & 16 is indicative of 
a P*0201 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
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experimental primer combination SEQ ID Nos. 15 & 16 is indicative of 
a P*0202 CCR5 promoter allele; 

(g) determining a CCR genotype for each sample based 
upon said CCR alleles indicated following step (f); and 

5 (h) analyzing said genotypes determined in step (g), 

thereby genotyping said representative number of individuals with 
respect to said risk group assigned each individual, thereby 
correlating CCR genotypes with HIV-1 transmission and/or disease 
progression. 

10 18. The niethod of claim 17, wherein said biological 

samples are selected from the group consisting of blood, serum, 
saliva, semen, tissue biopsy and DNA. 

19. The method of claim 17, wherein said separation is 
15 selected from the group consisting of gel electrophoresis and size 

fractionation. 

20. The method of claim 17, wherein said amplifying is 
by polymerase chain reaction (PCR). 

20 

21. The method of claim 17, wherein said analyzing is 
by mean? selected from the group consisting of two-tailed Fisher's 
exact test, multiple logistic regression analysis, univariate analysis 
and multivariate analyses. 
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22. An oligonucleotide selected from the group 
consisting of SEQ ID Nos. 2-16. 
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SEQUENCE LISTING 

<110> Tang, Jianmir.g 

Kaslow, Richard A. 

<120> Methods of Surveying For CC (Beta) Chemokine 

Receptor Variants and Their Association With HIV- 
1 Transmission and/or Disease Progression 

<130> D6217PCT 

<140> 

<141> 2000-08-11 

<150> 60/148, 530 

<151> 1999-08-12 

<160> 35 

<210> 1 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 

<223> universal M13 primer 

<400> 1 
acaggaaaca gctatgac 18 

<210> 2 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR2b-5/lS, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
dovmstream promoter region 

<400> 2 

tgggcaacat gctggtcg 18 

<210> 3 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

SEQ 1/11 
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<220> 

<221> primer 

<223> CCr.2b-5/2S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 3 

tgggcaacat gctggtca 18 

<210> 4 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR2b-3/lG, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 4 

tggaaaataa gggccacaga c 21 

<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-5/1S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 5 

gagtggagaa aaaggggg 18 

<210> 6 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 



SEQ2/11 
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<223> CCR5P-5/2S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 6 
gagtggagaa aaagggga 18 

<210> 7 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-3/1S, primer used for typing major 

polymorphisTv in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 7 
agaatagatc tctggtctga aa 22 

<210> 8 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-3/2S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 8 
agaatagatc tctggtctga ag 22 



<210> 9 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 



<223> CCR5P-3/3S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 



<400> 



SEQ 3/1 1 
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WO 01/128' 

gagaatagat ctctggtcta aaa 23 

<210> 10 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

downstream promoter region 

<400> 10 
tagagaatag atctctggtc tg 22 

<210> 11 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer . 

polymorphism m CCR2D, 
dovmstream promoter region 

<400> 11 
tagagaatag atctctggtc ta 22 

<210> 12 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 

downstream promoter region 

<400> 12 
agaatcagag aacagttctt cc 22 

<210> 13 
<211> 22 

SEQ4/11 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-3/7S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 13 
agaatcagag aacagttctt ct 22 



<210> 


14 


<211> 


19 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


primer 


<223> 


CCR5P-5/3S, primer used 
polymorphism in CCR2b, 
downstream promoter region 


<400> 


14 



for typing major 
CCR5 and the CCR5 



caggaaaccc atagaagac 19 



<210> 


15 


<211> 


19 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


primer 


<223> 


CCR5P-5/4S, primer used 
polymorphism in CCR2b, 
downstream promoter region 


<400> 


15 



for typing major 
CCR5 and the CCR5 



caggaaaccc atagaagat 19 

<210> 16 

<211> 19 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

SEQ 5/1 1 
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<223> CCR5P-3/8G, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 16 

gtgggcacat attcagaag 19 

<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer . 

<223> CCR5-SP4G, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 17 

tcattacacc tgcagctctc 20 

<210> 18 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 

<223> CCR5-PM6G, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 18 

tggtgaagat aagcctcac 19 

<210> 19 

<211> 5 

<212> DNA 

<213> Unknown 
<220> 

<221> gene 

<223> CCR5 promoter allele P*0101 

<400> 19 

gtcgc 

SEQ6/11 
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<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 

gtcac 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 

gttac 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 

accac 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 



20 
5 

d::a 

Unknown 
gene 

CCR5 promoter allele P*0102 
20 



21 
5 

DNA 

Unknown 

gene 

CCR5 promoter allele P*0103 
21 



22 
5 

DN& 

Unknown 
gene 

CCR5 promoter allele P*0201 
22 



23 
5 

Unknown 
gene 

CCR5 promoter allele P*0202 
23 

SEQ 7/1 1 
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accat 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 
gtcggc 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 
gtcagc 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 
gttagc 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 



24 
6 

DNA 

Unknown 
gene 

CCR5 promoter allele P*0101 
24 



25 
6 

DNA 

Unknown 
gene 

CCR5 promoter allele P*0102 
25 



26 
6 

DNA 

Unknown 

gene 

CCR5 promoter allele P*0103 
26 

27 
6 

im 

Unknown 
gene 

SEQ8/11 
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<223> 
<400> 
gtcaac 



CCR5 promoter allele P*0104 
27 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 
accagc 



28 
6 

Dm 

Unknown 
gene 

CCR5 promoter allele P*0201 
28 



<210> 29 

<211> 6 

<212> DNA 

<213> Unknovm 
<220> 

<221> gene 

<223> CCR5 promoter allele P*0202 

<400> 29 
accagt 



<210> 30 

<211> 24 

<212> DtJA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400> 30 
gatagattat atctggagtg aaga 24 



<210> 
<211> 
<212> 
<213> 



31 
23 
Dm 

Artificial Sequence 
SEQ 9/1 1 
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<220> 
<221> 
<223> 

<400> 31 
atagattata tctggagtga agg 23 

<210> 32 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400> 32 
aaagtctggt ctctagataa ga 22 

<210> 33 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400> 33 
gaagtctggt ctctagataa ga 22 

<210> 34 

<211> 25 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22i> 

<223> 

<400> 34 
ttaaaactct ttagacaaca ggttt 25 

<210> 35 
<211> 24 

SEQ 10/11 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400> 35 
taaaactctt tagacaacag gttg 24 
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SEQ 11/11 
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SBC!UEr>lCE LISTING 
<1 1 0> TAn.g / Ji a«jning 

<:120> Methods of Surveying For CC (Beta) Chemokine 

ft«ceE::'tor Variaiits arid Their Association With HIV- 
1 Transmission and/ or Disease Progxeeeion 

<130> D5217PCT 

<140> 

<141> 2000-08-11 

<1S0> 60/14S,530 

<151> 1999-06-12 

<160> 35 

<210> 1 

<211> 16 

<212> Dm 

<213> Artificial Sfiqufenee 
<220> 

<22i> priittier 

<223> universal M13 primer 

<400> X 
acaiggaaaca gctatgac 13 ' 

<210> 2 
<211> 18 
<212> DNA 

<213> ArtifiGial seguenco 
<220> 

<221> pritner 

<223> OCR2b-5/lSr primer u^ed £or typing tftajor 
poly^otphiain in CCR2br OCRS and the CCRS 
downstream promoter region 

<dO0> 2 
tgggcaacat gctggtcg L8 

<210> 3 
<21L> 18 
<212> DNA 

<213> Artificial Sequence 

SBQUll 



BNSDOCID <WO 0112857A2TI.> 



<220> 

<22i> priiinax 

<223> CCR2b-5/2£, primer used for typing r^^ajor 

pol^iTTioiThism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 3 
tgggcaacat gctggtca 18 

<210> ' A 
<211> 21 
<212> DNA 

<;213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR2b-3/ie, primer u^ed for taping major 

pclyincirt'hii^m in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 4 
tggaaaataa gggccacaga c 21 

<21Q> 5 
<211> 18 
<212> DNA 

<213> Artificial Seqiience 

<220> 

<221> primer 

<223> CCR5P-5/tS. primer used for typing major 
polymorpbiSM ift CCR2b, OCRS and the CCR5 
downstream promoter region 

<400> 5 
gagtggagaa aaaggggg 16 

<210> 6 

<211> 19 

<212> H'Ci 

<212> Artificial Ssquenee 
<220> 

<221> primer 



SEQ2ni 
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<223> priroer ij.eed for typing major 

p:>ii.Tfiorphiem in CCF2ts CCT45 and th$ CCR5 
downstream proinotef region 

<400> 6 

gagtggagaa aaagggg^ 18 



<210> 7 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> priiriGr 

<222> CCR5P-3/1S, primer used for typing major 

poliiTiorphisiD in CCR2b, CCR5 and the CCRS 
dQumatream prcic.^v^e^f iregicn 

<400> 7 

aga^tagatc tctggtctcr* 2^ 

<210> 8 
<211> 22 
<212> Dta 

<213i> Artificial Sequence 
<220> 

<221> printer 

<223> CCRBP'3/2S, primer uaed for ti'ping rrajor 
polinnoiT'hiflni in CClR2t). CCRS and the 0CR5 
downstream proraoter regi<?n 

<400> 8 

agaafcagatc tctgrgtct^a ag 22 



<2i0> S 
<211> 23 
<212> DMA 

<213> Artificial Sequence 

<22(i> 

<22l> primer 

<223> CCR5P-3/5Sr primer used for typing major 

poliTnorphism in CCR2b^ CCRS and the CCRS 
downstream ptomc-ter region 

<4C0> 9 



SEQ 3/] 1 
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gagaat^gat ctctcfgtcta aaa 23 

<2ia> 10 
<211> 22 
<212> DMA 

<213> Artificial Se.<3uence 
<22.0> 

<221> primer 

<223> CCR5P-3/4S. primer used fac tiding "lajor 

polymorphism in «:R2b. CCR5 and the CwR5 
downstraam promoter r^ion 

<400> 10 
tagagaatag atctctggtc tg 22 

<210> II 
<211> 22 
<212> DMA 

<213> Artificial Saquenc« 
<220> 

<221> primer 

<223> CCR5P-3/5S. priinex U0©d for typing major 
pol^nvprphisca in CCR2b, CCR5 a«d the CCR5 
downstream promoter region 

<i4(}0> 11 
tagagaatag atctctggtc ta 22 

<:210> 12 
<211> 22 
<212> DMA 

<213> Artificial Sequence 
<:220> 

<:221> primer 

<22S> CCRSP-3/6g, primer ^sed for Wing major 
polymorphism in CCR2b, CCR5 an<3 th^ CCR5 
downstream promoter region 

<400> 13 
agaatcagag aacagttctt cc 22 

<210> 13 
<211> 22 

SEQ4/11 
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<212> DNJ^ 

<212> AirtlfitciAl Segu^ice 

<220> 

.;j3> CCH5P-3/.S, cc^r art, "^S 

poli-morphiBin m k,CR2b, 
downstream promotar region 

<40C» 13 

<21D> U 

-£211> 1-3 

<212> DtIA 

<£13> ArtiftciAl sequence 

<220> 

<22i> prixcier 

poli'ttiorphiajti 3.n Cwiuox 
downstream prcinot«r region 

<400> 1* 
caggaaaccc atagaagac 



19 



<210> 15 
<21i> 19 
<2i2> law 

<213> Artificial Sequence 

<220> 

Mlyroorphism in CCR2I?, ^-v^^- ~ 
downstream promoter region 

<4D0> 15 

19 



caggaaaccc atagaagat 

<aio> 16 

<211> 19 
<212> DWA 

<211> Artificial Sequence 
<220> 

<221> primer 

SEQ3/II 
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<222'> CCR5P-2/8G, primer used for typdng major 
poi^Tiiorphism in CCR2b. CCR5 and thQ CCR5 
dov.7i£i::r$$at) pri^ocer region 

<400> 16 

gtgggcacat attcagaag IS 



<210> 17 
<2Ll::- 20 
<212> DWA 

s;213i' Artificial Sequence 

<220> 

<221> fKriiTier . 

<223> CCRS-SP4G, primer M&^d fox typing major 

poli-raorphisTO in CCE2b, CCR5 and the ^ CCR5 
dw^istreain proiriote-r region 

<dOO> 17 
tcattacacc tgcagccctc 20 



<210> le 
<211> 19 
<212> im 

<213> Artificial $£!quence 
<220> 

<221> primer 

<223> CCR5-PM5G, priiuer uaed for tj'ping inajc^r 
pclimoL-phism in CCR2b, OCRS and the CCR5 
downstream prconotdr region 

<400> 16 

tggtgaagat aagGCt<!ac t$ 



<21D> 19 

<211> 5 

<212> DNA 

<213> UtiVmown 
<220> 

<221> gen^ 

<223> CCR5 proraoter allele P*0101 

<dOO> 19 



gtcgc 



SEQ6/11 
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<211> 
<212> 
<213> 
<22<i> 
<22l> 
<223> 
<400> 

<210> 
<2H> 
<212> 

<220> 
<221> 
<223> 
<400> 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<4D0> 

accac 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 



20 
c 

g6n«i 

CCR5 prorrioter allele P*i}102 

20 



21 
5 

Unlcnown 
gene 

CCR5 promoter allele P*0lO5 
21 



22 
5 

Unknown 



gene 

CCR5 promoter allel© P*0201 
22 



23 
5 

Un)cn«tfn 
gene 

CCR5 promoter allele P*0202 
23 

SEQ7/11 
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<-210> 
<2X1> 
<212> 
<213> 
<220> 
<221> 
<323> 
<400> 
gtcggc 



24 
DMA 

gene 

CCR5 piorootex allele P*01Q1 
24 



<210> 
'<211> 
<212> 
<213> 
<220> 
<221-> 
<223> 
<dQO> 
gtcagc 



25 

gene 

CCR5 promoter allele P*D102 
25 



<210> 
<211> 
<212> 
<213> 
<220> 
<3Z1> 
<223> 
<4O0> 



25 
6 

DNA 

Unknovm 
gene 

CCR5 promoter allele P*0103 
26 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 



27 
6 

Unknown 
gane 



SBQ 8/11 
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<223> 
<40Ci> 



:C.P:5 promoter allele P* 01-04 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 

<iQO> 



23 
6 

DNA 

CCR5 promoter allele P*0201 
2g 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<40D> 
accagt. 



2d 
6 

3€ne 

ecus promoter allele P*0202 
29 



<210> 

<211> 
<212> 
<213> 
<22D> 
<221> 
<225^^ 
<*D0> 



3D 
2d 

Ai-ti€icial Sequence 



30 



ga tag at tat atctgg^gtg aaga 24 

<210> 31 

<211> 23 

<212> im 

<213> Artificial Sequence 



SBQ9/I1 
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<22Q> 
<221> 
<223> 

<400> 31 
ata.gafc.tata tctggagtga agg 22 

<210> 32 

<2XX> 22 

<212> DNft. 

-<213> Artificial sequence 

<220> 

<221> 

<223> 

<40O> 32 
Aaagtctggt ctctagataa ga 22 

<210> 33 

<21i> 22 

<212> DNA 

<213> Artificial Sequetice 

<220> 

<221> 

<223> 

<400> 33 
gaagtctggt ctctagataa ga 22 

<21Q> 34 

<211> 25 

<212> VSbJh 

<213> Artificial Sequence 

-«;220> 

<221> 

<223> 

<A00> 34 
ttaaaactct ttagacaaca ggttt 25 

<21D> 35 
<2U> 24 

$EQ I(V11 



BNSCXXID: <WO Oil 2857 A2TI_> 



PClVL'SLlO/11255 

WO a 1/1X^7 

<212> Dt^ft 

<2i3> Artificial Se-T-i^ce 

<220> 

s:221> 

<223> 

<40Ci> 3-5 

taaaactctt tagacaacag gttg 24 
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